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ABSTRACT 

The research objectives were to overlay ancillary map data onto a Landsat multispectral data base, create 
and test land-use change prediction models, and map land uses in the Denver Metropolitan Area. Linear- 
discriminant analysis was used to model spatially projected land-use changes and to classify the single-date 
Landsat- 1 image with ancillary data inputs. 

A landscape model was constructed with 34 land-use. physiographic, socioeconomic, and transportation 
maps. A simple Markov land-use trend model was constructed from observed rates of change and non- 
change from photointerpreted 1963 and 1970 airphotos. Seven multivariate land-use projectknt models 
predicting 1970 spatial land-use changes achieved accuracies from 42 to 57 percent. A final modeling 
strategy was designed, which combines both Markov trend and multivariate spatial projection processes. 

Landsat- 1 ima^ preprocessing included geometric rectiHcation/resampling. spectral-band, and band/ 
insolation ratioing operattons. Rectai^ular training-set selection gave biased and unreproducible results. 

A new, systematic gtid-sampled point training-set approach proved to be useful when tested on the four 
original MSS bands, ten im^e bands and ratios, and all 48 im^ and map variables (less land use). Ten- 
variable accuracy was raised over 15 percentage points from 38.4 to 53.9 percent, with the use of tlie 31 
ancillary variables. Only three optimal ancillary variables added almost 1 2 of these 1 5 percentage points 
of improvement. The maximum-likelihood ratio classifier was also established to be inferior to linear- 
discriminant analysis for land-use mappit^ 

A land-use classification map was produced with an optimal ten-channel subset of four image bands and 
six ancillary map variables. Point-by-point verification of 331.776 points against a 1972/1973 U.S. 
Geological Survey (USGS) land-use map prepared with airphotos and the same classification scheme 
showed aveT'^e tirst-. second-, and third-order accuraefes of 76.3, 58.4. and 33.0 percent, respectively. 


The average direct development cost for this 24-class land-use mapping effort with the 48-variable ovc^ 
layed Landsat spectral/spatial map data was $0.0468 per hectare ($0.0189 per acre), or $704.19 per 
t: 24.000-scale USGS quadrangle, with the direct production cost using only the optimal ten-channel 
spectral/spatial data set estimated at an average of $0.0317 per hectare ($0.0128 |»r acre), or $477.17 . . . 
per 7.5 minute USGS quadrangle. 


Rerammendations for further evaluation study include densification of socioeconomic data, improved 
digitization of topographic elevation, input of collateral soils data, multidate land-use change detection, 
greater use of ancillary map data, and further information systems development. 


•A portion of the work was performed wliile serving as a Senior Post-doctoral Research Associate, Goddard Space Flight Center. 
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CHAPTER I 


INTRODUCTION 


GENERAL 

Planning Is a proem that systematically deals with the t'ollowing (Reference 1 ): 

• Formulation of objectives and standards with which to specify future conditions 


C ollection and analysis of relevant data to accurately determine existing conditions and trends 


• Development of alternative plans to achieve the desired conditions 


• Selection, adoption, and implementation of an “optimar plan from the available alternatives 


Land'Use plafmit^, tlie name commonly applied to the tasks of designing a unified developnii'nl plan, must 
consider the spatial arrangements ot alt land uses. Such land-use plans range from the loc;*'‘on of trans(H>r- 
tation routes to the location of public facilities such as air|>orts, dams, open space, parks, power plants, 
and schools. 


rhe increasing si/.e and complexity characterizing the nuxlern, large-scale land-planning organization has 
made the historical managerial functions of planning, organizing, and controlling the future spatial distri- 
bution of land use much more difficult to achieve. At the same time, the sucemful application of these 
tunctions has become increasingly essential to the orderly development and stability of today's urban and 
rural communities. 

Rapid growth pressures on the lands used for agriculture, mineral restrurces, open space. outdiH>r recreation. 

supply and storage, transportation, and wildlands rei|uires substantial advances in the collection, classi- 
ticationt and availability ol appropriate land inventory and planning data in general. 


rite requimd inventory and planning data are not generally available to land planners at any level. When 
available, these data often lack completenm. quality, and timelinm. The availability of timely data in the 
lorm ol a detailed, quantitative Inventory and planning data bank provides land planners with a dynamic 
definition of the tradeoffs involved in evaluating the following; 

• Alternative sites for the same type of development 

• l»ropi>sed alternative sites to be earmarked for development 

• Reciprocal effects between development/nondevelopment and activities in adjacent areas 


.t 


WTENTIONALLV blani^ 


Piannfng Process 

Urban planning has traditionally cniphasi7.cd efficient functioning of cities in a purely economic and engi* 
necring sense, with particular attention devoted to land-use activities, transportation, and /.oning. This 
emphasis has been termed ‘'physicaf* planning arid seeks to achieve a viable and sound land-use pattern 
(Reference 2). A stress on “current” planning permits only a land regulation-type of urban planning In 
which subdivision codes, zoning, and parcel-oriented policy-making is paramount. “Advance” planning 
addresses the longer range, but equally urgent, problems such as blight, obsolescence, suburbanization, 
and sprawl and may be done both intermittently and incrementally. Physical planning works as an agent 
of private development, rather than as a normative instrument for the public welfare. It does not challenge 
the 19th century view of land as a speculative commodity (Reference 3), but merely Imposes ground rules 
under which speculation continues. 

Promoted through the Housing Act and other recent federal legislation, social planning adopts a socioecono- 
mic, political, and physical approach to the development and functioning of the urban community in an 
effort to compensate for the past failures of physical planning and to address m^jor issues and problems 
cx>nfronting the hitherto ignored human and social elements. Historically, reconciliation of the two philos- 
ophies has been unsuccessful 

Regional growth has been accomplished through near-universal phenomena of air, land, and water pollution, 
undesirable economic contingencies and land-use conversions, traffic congestion, and urban blight and sprawl. 
Although urban land uses command the greatest vsUue in monetary terms, the “value” of open, nonurban 
areas remains uncontested and essential. This open space is being progressively preempted by other land uses 
because of a myriad of factors inherent In the employment of current planning practices. Therefore, it is 
both necessary and valuable to inventory large urban and urban fringe areas and to delimit land uses and 
potentials. 

It Is essential to first develop an understanding of the landscape in terms of its cultural and natural components. 
These are difficult to measure because of the complex interactions among air, water, land, biomass, and 
cultural resources. In addition, these landscape elements are always in a dynamic state of succession, how- 
ever slow or fast that successional rate may be. 



Needed Tedinology Transfer 

Local and regional planning agencies represent a large number of potential users of the evolving remote sens- 
ing technology. Remote sensing may be defined as the noncontact collection, analysis, and interpretation 
of data from aerospace platforms. It sltould be realized that this remote sensing extends far beyond the 
realm of conventional aerial photography, which is only a small, representative facet of the increasingly 
expanding and sophisticated whole. Spatial remote sensing data are available over large areas and can already 
be cost-effectively obtained relative to traditional ground methods. Remote sensing data can be collected, 
interpreted for various purposes, stored for further use, and used as a historical record. It can also be employed 
more easily, amsistently, and objectively, and at lower unit-area cost than most ground-based surveys. 

Remote sensing offers considerable potential in advance planning procedures In which unique advantages 
arc realized in dealing with environmental issues in which policy-making must be based on both quantitative 
and qualitative data. It permits for the first time the practical establishment of standardized observations of. 


4 


i nd criteria for, change. Observations at appropriate time intervals will reveal the precursors of change ami 
trends sufficiently earlier so that land planners can probe for basic causes and assess latent probabilities 
Continuous planning at regional scales becomes practical. Lastly, remote sensing presents a permanent 
record of urban phenomena unbiased by the planner's experience. 

Tliese advantages of remote sensing suggest practical applications in a different form than is currently prac- 
ticed. Tlie rising tide of environmental concern popularly manifested in the l‘)70's is exerting great pres- 
me on “front-line" planning agencies to shed their historical burden of current administrative problems 
They must now assume greater responsibilities for the quality of life, environmental quality, and other long- 
neglected urban problems. Because environmental deterioration and destnictlon can be traced to man’s 
interference, there are increasing pressures to monitor, measure, and evaluate his activities. Surrogates for 
environments indicators have been proposed, including the analysis of disparate and preemptive land uses 
(References 4 and 5), changes in land value (Reference 6), and the quality and care of residential lawns 
(Reference 7). While the greatest current demand on remote sensing is to generate land-use data, the technical 
capability simultaneously exists to extract quantitative and qualitative data of which land use is but a part. 

Unfortunately, there has been relatively limited success as yet in incorporating remote sensing applications 
into the routine operations of urban and regional planners and dedsion-makers. The mechanics for accom- 
plishing such a technology transfer are simply not known. The scarcity of such planning applications is not . .. 
apparently related to either the quality or utility of remote sensing techniques, nor even simple Ignorance. 

One major obstacle in realizing the full potential of remote sensing is that its fonn has been incompatible 
so far with the census tracts, blocks, and parcels and other land-use inventory schemes used by pb' » rs 
WhOe remote sensing data can be interpreted to yield both spatial and statistical parameters, these data must 
also be aggregated and interfat'ed, for the time bdng, to the areal units that are compatible with the planners' 
normal geographical definitions of the city or region. 

A second aspect of the technology transfer problem is the necessity to create and maintain large geographic 
data bases over extensive areas. The various available data input sources are rectified to a common geographic 
base to meet social planning requirements. Basic social land-use planning concepts have not yet been widely 
researched or implemented. UoWever, recent developments in electronic data processing have created the ' 
possibility tliat the voluminous quantity of available social planning data may be manipulated more easily 
and economically in the form of a geographic information system. Remote sensing can provide wide, regional 
coverage of inaccessible or sparsely monitored geographic data. Thus, the computer-based geographic infor- 
mation system can, in turn, effectively blend regional planning and remote sensing to simplify both physical 
and social planning data requirements and actions. Such data cxillection, retrieval, and analysis capabilities 
are essential for the monitoring, measurement, and evaluation of land-use changes before planners can 
undertake environmental planning and thereby assume large-scale environmental management responsibilities. 

( urrently. remote sensing and geographic information systems remain as intennittent, piecemeal grafts on 
a persistently manual methodology of operaticn. i land-use analysis that is laigely cosmetic In nature. 

STUDY OBJECTIVES 


The objectives of this specific research were to overlay ancillary map data onto a Landsat multlsivctral data 
base, create and test various models for the prediction of land-use change, and map land uses in the Denver 




Metropolitan Area. Spccil’ic objectives included the following: 

• Development of a landscape model with landnise. physiographic, socioeconomic, and trans- 
portatton compemenrs 




Testing of a land*use trend model 

Development and testing of a spatial land-use projection model 

Overlaying of the Landsat image onto the landscape model 

Optimization of the Landsat image classification algorithm when used with landscape 
ancillary variables 


or 




Optimal land-use mapping of the study area using Landsat and landscape variables 
Display and verification of the machine-produced maps 
Tabulation of computer, labor, and material cost/time for the land-use maps 

SCOPE OF THE RESEARCH 

Technology Satellite (ERTS-1), has provided une.xcelled oppor- 
tonitKs to explore the utili^ of aerospace remote sensing data for large-scale analyses of land ecosystems 

Inn^Ti was sp^Rcally directed toward the extraction, use, and assessment of land^use invenC and 
planning data derived from such remote sensing imagery, ancillary map data, and geographic informatton 

ystems. This endeavor sought to illustrate which features and characteristics of these data inputs and 
systems afe useful in planning processes* 

The search w:b also structured to provide an analytical framework for modeling land-use changes and for 
identilViiw that were itinuenlial in conlrollinf «.ch ehans« The readta indicated the Ice?, of 

weuracy to be expected under operational conditions and also provided the reeominended procedures 
data sources, and computational techniques. 

GENERAL APPROACH 

organized and overlayed data from existing maps, census tables, and rismote 
ansing inugi^ into a computer trameworlc (figure 1 ). This assemblage provided a multivariate multi- 

obv^T'^ff represented the landscape much as a threcHlimensional model of the 

physical terrain is repim-nted by a topographic map (Reference 8). Coupled with this composite of 

laLToTml'^t simulations of the 

spatial or maHjke Behavior of this landscape to either natural or man-inducetl alteration and control 

(Rt fen*n».e d). The thrust of land-use modeling was the prediction and display in map form of the future 

landscape that would result from the continuation of current land-management practices or the lack 
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INPUTS: 



/ TOPOGRAPHIC MAPS 
VEGETATION MAPS 
SOILS MAPS 
GEOLOGIC MAPS 
TRANSPORTATUm MAPS 

/ CENSUS TRACT MAPS OF 
) POPULATION 
j FAMILY COUNT 

/ HOUSING 
' CAR OIVNERSHIP 


satellite images 
digital 

OPTICAL 

AIRCRAFT IMAGES 


PHOTOINTERPRETATION 
COMPUTER PATTERN CLASSIFICATION 

COMPUTES: 


SLOPE AND ASPECT PLANES 
SOCIOECONOMIC DENSITY PLANES 
MINIMUM-DISTANCE PLANES 
TRANSPORTATION ACCESS PLANES 
SOLAR-RADIATION PLANES 
LANDSAT TRANSFORMATIONS 

OUTWITS: 


SITC DEVELOPMENT PLANS 

COMPARISONS 
ZONING ALTERNATIVES 

^£,wany other applications to 

f PLANNING. 

ETC* 




f objective provided a basis for the proposal for the improvement of these tcchnluucs 

to p^edict how the landscape wUl evolve in a spatial sense to various scenarios of anticipal altrlti^^^^^^ 

bZul* Of remote seming imageiy provided the Important current and past land-use inputs to the 

land-use modelmg and was totally compatible with the modeling process. Symbioticaliy «f ,h. 

vT w '"h substantially improved by including landscape 
i^he M “ Thus, combining the available remote sensing imagery tith map data 

m the digital landscape model provided the basis for substantial Improvements in boJh actfvlries 


7 



and lor saHsIVing llu- alalerobMTOi''UtaW rcsnurces for testing the basic hypotheses 
and-while airpholos provided the mans of th i ^ of low-allitnde black- 

measnres of change Tldt? ;™ohardlte^"H “ n doantitatlv. 

caeellen, Augnstlg. «-o 

671 (Reference lO) hlerirchlcal land-use elaJrr«tir.!^cT^ . Geolo^cal Suivey (USGS) Circular 

both manna, and antomatel! i-'orP-Wlons in 

were originally plwtointeroretcHl for Twenty-one second-order USGS land uses 

13 second-order and 1 1 third-order catesorii's well ^ spatial land-use projection. However, 

to the classes used in a 1 972-1973 USGS nhotoiptemmt t* * land-use classification to conform 

ence 1 1). Although USGS Cirlinr Metropolitan Area (Refe^ 

not made here because a large amount of tat^wreelated^Irr*^"*** Paper 964 (Reference 1 2), revision was 
been completed using the Circular 671 approach. ^ Photomterpretatlons, etc.) had already 

S'^Ino,'J^":rTe'^r^^ ^ ohm«es in mmote. 

posed in 1972 m a stad^L *“ to have been pro 

tested by the USGS Geographic f 

toted land-use changes using sateUite and hieh ,l?r.T 'f research programs that moni- 

ttat used the sche2 stnlVttfcSS? 

the Ozarks ragion, and s<Xn tS^tRe?^^ «X 

testedineachofthesera.o„sanrr^rw:k'l\^^^^^^ 

ANALYSIS PROCEDURES 

quantities of input data w*we u°sS landscape data. These diverse sources and 

analysis methodologies. It lould be ^oh«st ^ rT.M '' manual data collection and 

can serve no practical purpose unless both • t' quantities of map and remote sensing data 

essing. TTterafore, the data analysis pla^ wmZZrad“rty durtSeroro^^^^ 
progression of steps needed for accomplishing the study oblectim , LT,T rl! p ,? 

thebracketednnm.L,U,1^1nrSr=^^^ 

Data Resources 

Jlrtv^anr^^^ 

(Landkai uenc IJS^iruu) for^e**DCTm^ **“*'**'* “>• lUgh-qualHy summer ImaKe 
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TABLE 1 

“enver metro. 

ctenlficatlon wat used for manual airphoto Interpretation and >!I (R«ference 10). This standardized 

automated Undsat image analysis. The system revised a^ third-order classes for 

(Reference 12). with minor changes as Prof^^^^ 


Digital 

Codes 


11 


12 

121 

13 


14 


15 

151 

16 


17 


18 


19 

191 


192 

21 

211 


212 


213 

22 


23 


24 


31 


32 * 


33 


34 * 


41 

411 

42 

421 


422 

43 ^ 



51 

52 

53 
54 ^ 
55 

61 

62 * 

7l* 

72 * 

73 

74 

75 
«!♦ 
91* 


Fim-Ordcr U«J Usc/Und Cover 
SccondOrdcr 

ThirdOrder 


Urban and built*up land 

Residential 

Commercial and services 
Recreational 

Industrial 

Extractive 

Transportation, communications, and utilities 
Utilities 

Institutional 

Stf ip and clustered development 

Mixed urban 

Open and other urban 

Solid-waste dump 
Cemetery 

Agricultural land 

Cropland and pasture 

Nonitr^ated aopland 
Irr^ated aopland 
Pasture 

Orctods. groves, and other horricultura! areas 
pccdit^ operations 
Other agricultural land 


Rangeland 


Forest land 


Water 


Grass 

Savannas 

Chaparral (taken as brushland) 

Desert shrub 

Deciduous 

Deciduous/intermittent aown 
Evagrecn (conifefous and other) 
Coniferous/soltd aown 
... , ^ Conifefous/intamittent aown 
Mixed forest land 

Streaihs and waterways 

Lakes 

Reservoirs 

Bap and estuaries 

Otha water 
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♦Land-use/Iand-cover type not found in the Denver Metropolitan Area. 


Nonforested wetland 
Vegetated 
Bare 

Barren land 

Salt flats 
Beaches 

Sand otha than beaches 
Bare exposed rock 

Hillslopes 
Otha barren land 

1’undra 

Tundra 

Permanent snow and icefields 

Pamanent snow and icefields 
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cultural landscape data included census tract-af^egated sock>economic data for 1970 (2) and a 197 J 
State Highway Department srreet/highway classification map with eight road cate^^rfes 131. 

Physi^aphic landscape data consisted of a USGS l:62,500.scale surficial geology map 141, sixteen 
ls24,(W0.scalc topographic maps [51, and computer-generated slope 171. aspect 181, and insolation llOl 


resources (121 included the four basic multispectral scanner (MSS) images of a single- 
date 1973 scene, the six MSS image-to-image ratios, and four MSS/insolation-normalized ratios (111. 

Analysis Procedures 

The modeling activity was s^ctured to analyze the interactions. between land-use changes and the regulating 
physiographic, socioeconomic, and transportatidn components of the landscape. This involved the compila- 
tion, rectification, and generatidn of spatially registered data planes for the various landscape H«‘ments. 

Each landscape data plane represented a spatially distributed, single-variable map for the 39- by 39-km* 

(2^ by 24-statute mile) or 1491-km^ (57 6-mP ) study area centered on the Denver Metropolimn Area. The 
majoriy of these data planes were obtained by thematically sampling and computationally transforming the 
available maps using a fixed 192-row by 192-column dot grid yielding 36,864 cells of 4 hectares (10 acres) 
per plane. These planes, using the popular grid-ceU technique, were accessed in overlaying operations that 
combmed single-variable data planes with conformable spatial dimensions. 


Ian McHarg's r^onal landscape analysis method is a well-known manual method of such overlaying processes 
(Reforence 15). This approach physically overlays colored transparent acetate-plotted variables to view 
spatial rarrelations and juxtapositions as composite colors; however, as mare variables are considered, the 
display wconws confusing, and the approach becomes less useful The overlay process used in this research 
was carefolly designed after a thorough review of the available systems (Reference 9). Implemented as a soft- 

it ^ ai,pUy 

Linew disCTiminant analysis was selected as the computational vehicle for implementing the dual land-use 
predictive modeling (141 and inventory mapping (151 capabUities. This dual use represented a new, multi- 
variate statisucal approach to hnd-use mapping. Discriminant analysis permitted each landscape cell to be 
repf^ntM as a pmnt m a multidimensional, statistical framework in which each observation was a measured 
observations of land use or change in land use were used to determine orthogonal 
«es (disai^nt functions) to a poss&le total of one less than the number of land-use variables represented 
Each new observation can be plotted and identified by means of a single disaiminant score. 

The mapping with Landsat MSS data or modeling future land-use patterns with ancUlary map-format data 
required the computation of representotive statistical signatures for each land use or land-use change of 
interest. Subsequently, each unknown individual land use, land-use change element, or landscape ceU was 
classified according to the closest multivariate natch with the nearest numeric signature subset or class 
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Similwly, a landscape cell within this invcntory/modcling system consisted of a series of spectral responses 
from MSS data and/or a statistical measure of several types of ancUlary data from a given area within the 
envcr study area. This response was a vector of size equal to the total number of spectral bands and trans- 
formations and/or ancillary map variables. Both forms of data were interleaved to represent this vector on 
digital tapes in proper geometric relationshiiK Thus, the mode of analysis could be specified by an appro- 
priate selection of image, land-use, socioeconomic, physiographic, or tran.sporta.tion. variables. 

Discriminant analysis was performed in a stepwise fashion (i.e., by entering one additional variable or attri- 
bute into the set of discriminating variables for each successive iteration). The variable to be added, but not 
aheady included in a previous iteration, was selected for inclusion on the basis of greatest F-value. Thus 
the analysis operated in a “supervised” fashion on user-specified representative training sets of known land- 
use composition tot (l) evaluate the statistical utdity of each variable as hierarchically deteriftined by dis- 
criminant a. talysu uud (2) quantify the overall and- incremental laftd-use mapping/modeling acccuracies for 


Utilization of ComputM* Program Modules 

When possible, avaOable operational computer software was used to bypass potential “programming bottle- 
necks in handlmg the large quantities of imagery and map data associated with the study. 

The TOPOMAP program (References 16 and 1 7) ci ated a digital terrain model [6] with the elevation data 
to compute topographic slope {71 and aspect {8) data planes. These topographic slope and aspect data 
planes were input to a digital spectroirradiance model (program 1NS0L2) [91 to compute a near-instantaneous 
incoming solar radiation or insolation data plane [1 0] * 

Program TRANSF2 from the Colorado State University (CSU) Undsat mapping system (LMS) (Reference 18) 
unused rotate the SIX MSS ratios and four MSS/insolation ratios [ 111 . Other Landsat -1 preprocessing 
operations porformed by the LMS package included data reformatting and tape merging (121 ♦ 

Program PLANMAP (Reference 1), a geographic information system, compared the 1963 and 1970 land-use 

data planes and produced urbanization rates for the various land-use classes to drive a Markov chain sue- • 
cessional trend model (131. 

Automated image classification with and without ancillary data access ( 151 was tested with the channel 
se ection feature of the EXTRACT and CLASSIFY linear discriminant analysis programs in LMS. CLASSIFY 
represented^ adaptation of the BMD07M stepwise multidiscriminant analysis routine in tlie biomedical 
design (BMD) program series (Reference 19). This stepwise approach also permitted the selection of an 
optimal subset of image and map variables for a full-image classiflcatioA (161. 

The simple tabulation programs, CHECKERl and CHECKER2, were written to facilitate a point-to-point 
comparison of the classification map file to the 1972-W73 USGS landnise reference plane for the multilevel 
tabulation of land-use mapping accuracy (171. 

Finally, extended development and interrelated use of these land-use inventoiy and modeling programs 
indicated that it was possible to develop even more sophisticated models to assess the limitations and impacts 
from, or response to, various land-use scenarios ( 181 . 
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DETAILED REPORT ORGANIZATION 

Chapter 1, Introduction, describes the study and provides general comments on land-use planning, back- 
ground, and rationale, objectwes, scope, approach, and the analysis plan. 

Chapter 2, “Undscape Model Construction," presents the map sampling and compilation methodology 
involved in constructing the land-use, physiographic, transportation, and socioeconomic map data planes. 

Chapter 3, "Spatial Land-Use Projection," describes the spatial land-use projection models, testing, and 
results, and outlines a revised spatial-change modeling strategy that combines both the Markov trend and 
disaiminant analysis models. 

Chapter 4, "Landsat Land-Use Classification,” explains the Landsat land-use classification effort. It deals 
with image rectification and resampling, ancillary map data overlaying, feature extraction sampling methods 
and verification, and classification algorithm selection and testing. It also gives the costs of developing the 
data imd optimal approach. Finally, this approach is applied to the total study area in a cost-effective format, 
and displays and verifies the results achieved. 


Chapter 5, Conclusions," summarizes the basic endeavors, activities, and results of the study. Applications 
areas are identified, and various recommendations are made. The recommendations generally concern ancil- 
lary data inputs and machine interpretation and processing. 

Five appendixes support the main text: 

• Appendix A, "Multiple Discriminant Analysis," presents a brief overview of the multivariate 
statistical technique of linear-disaimitunt analysis, used in this study for both spatial land-use 
projection and Landsat land-use classification. 

• Appendix B, "Landsat Mapping System" (LMS) describes the set of programs making up the 
Landsat Mapping System that has been implemented at Colorado State University. 

• Appendix C, Machine Classification Error-Rate Estimation," tabulates omission/commission rates 
for the first- and composite second- and third-orda- USGS land-use classes. 

• Appendix D, "Maximum-Likelihood Ratio," desaibes the widely used multivariate classification 
algorithm tested against linear-disaiminant analysis in this study. 

• Appendix E, "Correlation Matrices for Rectangular/Point-Sampled Training Sets," presents these 
statistics. 
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CHAPTER 2 


LANDSCAPE MODEL CONSTRUCTION 


INTRODUCTION 

The evolution of land use in the United States is one of the most perplexing problems confronting our 
society. It involves a tangled web of biological, economic, environmental, institutional, pnysical, and poli- 
tical interactions. The land conversions that result can be studied in terms of aesthetics, aging, birth rates, 
climate, crowding, economics, industry, legislation, mineral extraction, mortality, public services, social 
amenities^ technology, topography, transpoxtation, vegetation, water, and zoning. 

It was concluded that the most fruitful approach to understanding the evolution of urban land use was by 
the construction and manipulation of a landscape model of the (ireater Denver Metropolitan Area. This 
landscape model provided a better understanding of Denver s physical environment from a joint analysis 
of the various remote sensing images, collateral landscape maps and data (e.g., land-use, physiographic, socio- 
economic, and transportation maps), and a literature sur\’ey for the historical ix'riod under study. These 
collateral data stored in the landscape model provided the data base for the analysis necessar>‘ for understanding 
the present and recent past patterns of land-use evolution in the Denver Metroix>litan Area The analysis also 
yielded insights into the bioU^ical. economic, environmental, and social driving forces that have induced the 
rural-Uvurban land-use conversions observed. Most important, these collateral data were indisiwnsable as both 
spatial and statistical geographic inputs to the evaluation, analysis, and presentation of a quantitative landscape 
model for preilicting future spatial changes in Deave-r land use.. 

The cultural and natural landscape components of land use are not only undergoing a dynamic state of suc- 
cession. but are also subject to quite varied and complex driving forces. Accordingly, the landscape model- 
ing effort focused its time and resources on selecting ancillar>' data factors that were believed to be of 
broad applicability, usefulness, and general availability for the intensively urbanized areas of the United 
States. Thus, in this undertaking, special emphasis was placed on validating and utilizing meaningful and 
widely available data input sources to the landscape modeling to allow replication by interested researchers 
or planners for other metropi>litan areas. Although some data sources will not be univers;illy available 
throughout the country, it was anticipated that very close, if not identical, surrogates would exist in the 
majority of cases. Preference was always given to universally available data such as Department of C'ommerce 
census data. US(?S topographic and geologic maps. National Aeronautics and Space Administration (N.\SA) 
satellite imagery, and historical sequences of airphotos in contrast to equally usable but more localized and 
volatile data sources. Even these more universally available data have been collected with diverse techniques 
and map scales by a wide assortment of public and private organizations. The landscape model provided 
a depository for interrelating such diverse information in a common format. A model variable i>r data 
plane is one overlay of spatially registered data in a common cellular network upon all other data planes 
or variables in the landscape motlel (figure .D. 
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ORIGINAL PAGE IS 
OF POOR QUALITY 


7 LANIMJSE SUBMODEL VARIABLES: 

— 1963 Photo L«id-UM 
1970 Photo L«nI4Jm 

1972 to 1973 US6S Photo L«mI-Uw 
T vjo 1963 to 1970 L«mI-Um ChangM 
Two 1963 to 1970 Alphuiuimrie Codod 
L«nd4JM Changtt 

9 PHYSIOGRAPHIC SUBMOOiEL VARIABLES: 

— > Topographic Elevation 
Tcqaographie Slope 
Topographic Aspect 
USGSSurfidal Geology 
Solar Insolation for Landsat Imagery 
Four Landsat MSS Insolatton Ratios 

5 TRANSPORTATION SUBMODEL VARIABLES: 

— ■ Composite Minor-Road MD 
Composite Major-Road MD 
Freeway MD 
Freeway Interchange MD 
Built-Up Urban-Area MD 

17 SOCIOECON(MMIC SUBMODEL VARIABLES: 
Four Population/Housing Densities 
““ Per Acre 

Five Population/Family/Houslng- 
Unit Totals 

1969 Mean Family Income 
Median Housing-UnH Rent/Value 
One*/Two-/Three-Car Family Totals 
Total Census Tract Acreage 
Avwage Number of Cart Per Family 

10 landsat IMAGE SUBMODEL VARIABLES: 

— MSS4 (Visible Green) 

MSS-6 (VisKtie Red) 

MSS-6 (Solar Infrarad) 

MSS-7 (Solar Infrired) 


FIGURE 3. CONCEPTUAL DIAGRAM OF THE LANDSCAPE MODEL OF THE DENVER STUDY AREA. 
Available maps, spatially referenced tabular data, and remote sensing imagery were assembled, "'^preted.^s- 
S and ovwiaid Into a cellular landscape model with 48 image/map variables. D^is landscape -od« PjMed^e 
data base for understanding the present and recent past patterns of land-use evolution and their causal factors (MD 

minimum distance) 
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CONSTRUCTION OF A SPATIAL DATA PLANE 


At the outset of the design of the landscape model, a complete Inventoiy was prepared of all available 

ZTl' * imagery, ranging from the eariy low-altitude 

U^^beTno^ the raid*1930‘s to current Landsat Imagery (Reference 1 ). The data planes 

to be input to the landscape model were selected from this thorough inventoiy. New data were continualh 
^ught and evaluated as available data were found to be incomplete, incompa^e, or inade^re ^the 
^dscape model. Caution w^ taken to ensure that not too much money and eneigy were expended on 

*itat adequate resources remained for constructive analysis and 

mod!nna ^ ^ tt ^ «mong those now using some form of landscape 

modeling, especiaUy in smaller planning agencies, to think that assemblinr data planes in composite ^ 

institutes planning and analysis, whereas it is in actuality only a necessary prerequisite. 
These planners and experimenters are currently enamored with sophisticated processes f^puttinl huge 

r f ‘he thought goes into the 

effort ha, mere 5-percent, last-minute attempt is made to determine how to analyze it. This 

effort has selected the simplest means possible for overlaying each data plane onto the landscape model and 
has concentrated the bulk of its attentioti-on analyzing the data planes. 

Mig) Sampitng Procedure 

Potential landscape modeling variables were evaluated for general availability and significance. Thirty-four 
map variablw were selected for inclusion in the landscape model and were divided into land-use, physio- 
graphic, socioeconomic, and transportation submodels (table 2). 

Some explanation of the two general types of variables that were overlaid on the model is in order here. 

A categoncal variable denotes the presence or absence of a categoiy or class. An observation of this type 
IS mutual y exclusive (that is. it can fall into only one distinct preselected class or category). Thus each 
lam^-use data plane has 24 specific classes, whereas the surficial geology variable in the physiographic sub- 
model has 13 specific classes. A numerical variable refers to a purely digital sequence or range of continu- 
ously v^mg numbers. Observation values can be directly compared with one another In a mathematical 
sense. Topographic elevation and the minimum distance to minor roads are examples of numerical variables. 

The landscape model covered an area of 39 by 39 km (24 by 24 mi) or 1491 km* (576 mi* ) centered on 
the city ot Denver (figure 41 Most of this study site has relatively low relief except for about 5 percent of 
the area dong the southwestern edge, which includes the eastern foothills of the Rocky Mountains. Tlie 
reater Denver Metropolitan Area is entirely contained within this area and includes Denver proper, which 
contTnued^gror^^^ population center of approximately 1,500.000 people with future expectations tor 

y*f rr”*' ' Tfn north-south rows and 192 east-west columns for 

a total ol 36,864 4-hectare2 (lO-acre^) mapping cells. This cellularization permitted any spatially distributed 
two- or IhreeHlimensional landscape variable to be rectified, sampled, encoded, stored, retrieved, and dis- 
played by Its implicit position in this predetermined computer-compatible grid network. Adopting this 
landscape modeling approach made it practical and feasible to construct and overlay geographical data 

planes ot multiple landscape attributes for subsequent computer analysis, modeling, and land-use change 
predictions. ^ 
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T^ABLE 2 

wa* »"odeled by24.mHe Denver Metropolitan Area 

lary v«viabte$ are arranged info four functional submodais a ••'V element These lists of ancil- 

presents map ciasses or categories with no imollelt mim t ^ J^**®"^^**' ''arable" is a date plane that re- 
selected to represent thent A "numerical variable" repI^nteTd 7^**1 

mathematical sense (i.e., an elevation of 40(tofeet is two times mSJlZ SSo'S” * 


Landscape 
Submodc) 

Land use 


Wiysiographic 


Transportation 


Socioccx>nomic 


Landscape Variable 

I photo land use 
1970 photo land use 
1972-1973 USCtS photo land-use 
1 963 to 1 970 land-use changes ( from) 

1 963 to 1 970 land-use changes ( to) 
1963 to 1970 alphanumeric land-use 
changes (from) 

1963 to 1970 alphanumeric land-use 
changes < to) 

Topographic elevation 

Topographic slope 
Topographic aspect 
Surtlcial geology 
Landsat image insolation 
Landsat MSS/insolation ratios 

C omposite minor road minimum 
distance (Ml)) 

C omposite major road MD 
I*'a*eway MD 
LVeeway interchange MD 
Built-up urban are a MD 
Total population 

Total families 

I otal year-round housing units 
I otal vacant housing units 
1 otal occupied housing units 
I %9 mean family income 
Median housing-unit value 
Median housing-unit rent 
I otal one-car families 
l otal twi)-car families 
I otal three- ' thrco-plus car families 
Lotal census tract acreage 
Population density per acre 
Average number of cars per family 
Average number of families per acre 
Average number of year-round housing 
units per acre 

Average number of vacant housing 
units per acre 


Source of Data 

1 :20,000-scale B/W photos 
1 : 24,000-scale orthophotos 
I : lOO.OeO-scale USGS map 
i%3 to 1970 land-use data 
planes 


Computed 
C\>mputed 
C omputed 
C omputed 
Computed 

C omputed 


Variable 

Type 

Categorical 


1 : 24.000-scale USGS 
topographic maps 
Computed from elevations 
C’omputed from elevations 
1 :62.500-scale USCJS map 
C’omputed from slope/aspect 
Computed as MSS insolati on | 
C omputed from 1 :45.000- 
scale state highway map 


1970 census reports and 
1 :84. 500-scale census 
tract map 


Numerical 

Numerical 

Numerical 

Categorical 

Numerical 

Numerical 

Numerical 

Numerical 

Numerical 

Numerical 

Numerical 

Numerical 
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CONTOUR INTERVAL 200 FEET 

WITH SUPPLEMENTARY CONTOURS AT 100 FOOT INTERVALS 
TRANSVERSE MERCATOR PROJECTION 


FIOURi 4. TOroSRAI-HIC MAP OF TH6 DENVER STUDY AREA. Th. OVM, taup<tor, of th. D««, ,,«l, „„ „ . 

Of 24 by 24 miles enclosing the Denver Metropolitan Area. 











As noted eariier, the simplest way of inputting each data plane into the landscape model was adopted. Gen- 
erally, this required overiaying a dot pattern representing a selected cell size on the map (Reference 19X 
Topographic elevation data, for example, is an essential part of any landscape model (figure 5). The input 
value of each elevation cell was estimated from 1 ; 24,000-scale topographic maps at the position of the sample 
dot for each of the 36,864 cells that constitute the elevation data plane. Very careful procedures were used 
to ensure that the commdn cell pattern overlaid by hand in this fashion on each new map or set of maps pro- 
vided a data plane that registered exactly, cell-by-cell, on all existing data planes (figure 6). This hand tabula- 
tion of all the map-foilnat data for analysis was very laborious but was more accurate for these initial research 
efforts in landscape modeling. When tiie basic principles of the approach are better understood, new landscape 
modeling efforts can input the data with a variety of more sophisticated map digitizing procedures. Many 
eariier studies of this nature dealt with more complex machine entry of map data into the computer and, as 
a result, expended less effort on its constructive analysis. 

A semitransparent multipurpose overlay data form (figure 7) was devised to combine both the coding and the 
keypunch operations. This 1 : 24,000-scale data form overlaid a quadrant of a township consisting of nine 
sections of land of 1.6- by 1.6-km (1- by l-mi> squares, with 64 rectilineariy distributed sampling dots per 
section representing the 4-ha (10-acre) square resolution sampling unit. A 4-ha (10-acre) square cell was 
selected because it represented the smallest individual area that could be adequately sampled from the ma- 
jority of the available 1: 24,000-scale source maps. Data contained on each 1: 24,000-scale map were cel- 
lularized by overlaying the data form on the map to match the boundaries of the appropriate township 
quadrant The desired map information was next traced onto the overlay, and the category identification 
was recorded in each area (figure 8). This overiay fonn was the direct input to the keypunch operation, and 
no further transcription of the data was required, 

Compilatim of the Data Plane 


Each horizontal row of dots on the semitransparent overiay form represented one punch card, and the key- 
puncher generated a deck of 24 cards directly from each 23-km* (9-mi* ) area of 24 by 24 cell elements 
(figure 9). The effort of keypunching the data from these sheets was minimized by punching only the left- 
most identity of a land-use change at the proper position on the card representing its column. The cell code 
of identical row elements was identified with successive cells left blank until the left4iand boundary of a 
different land-use category was reached. This process facilitated keypunching and subsequent verification 
steps because fewer keystrokes were made, and the suppression of identical elements aided visual proof- 
reading by comparing a line-printer display of the data deck with the original code form (figure 10a). 
Auxfllaiy computer programs were available to fill in the blank cells in a data deck (figure 10b), to assemble 
and display each total data plane from a group of data decks (figure 1 1 ), and to display a selected group 
of characteristics in a data plane (figure 12). 

When each source map has been sampled and assembled, it constitutes a data plane of the landscape model. 
Each of these data planes can be selectively displayed as a computer microfilm graphic (graymap) to illus- 
trate the spatial distribution of one or more categories or numeric ranges. As an example of these high- 
resolution computer graphics, a 1 ; 250,000-scale graymap can illustrate all first-order urban and built-up 
lands contained in the 1970 land-use data plane (figure 13). A second microfilm of tlie same data plane 
emphasizes all first-order agricultural land (figure 14). 
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FIGURE 6. GEOGRAPHIC REFERENCE OVERLAY FOR DATA FORMS. The exact location and internal grid row and 
column identifiers for the data coding forms (figures 7 and 8) are shown relative to the USGS topographic map boundaries 
(figure 5). Four data coding forms are used for the four quadrants of each township. 
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FIGURE 7. OVERLAY DATA FORM USED FOR SAMPLING A 1 :24,000-SCALE MAP. The nine 
lerfle squares in a three>by>three array overlay individual sections or square miles of land on the appro- 
priate 1 :24, 000-scale map. The eight-by-eight array of dots In each of these nine sections represents a 
sampling of each 10 acres, or 1/64 square mile per cell. The original form Is printed on transparent 
paper and can be drawn on while overlaying a particular source map. The form shown here has been 
reduced to a scale of about two-thirds of its original 1 :24,000 scale. 
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FIGURE 9. A BLOCK OF PUNCHED CAROS REPRESENTING ONE OVERLAY DATA FORM. Reading from front to 
back, these 24 cards represent exactly what is keypunched from the completed date form shown In figure 8. Note that only 
a minimal amount of keypunching and visual verification is needed to represent and check the map boundaries. Simple com- 
puter routines expand these data to fill every landscape cell. 


CONSTRUCTION OF LAND-USE SUBMODEL 

The 1970. land-use data already illustrated was obtained from 1 : 24,000-scale orthophotomaps photointer- 
preted for tliematic land use with the remote sensing-based classification scheme of Anderson, Hardy, and 
Roach (Reference 10), transferred to the semitransparent overlay data forms, and dot-sampled and assembled 
as described. When an unrepresentative classification was made on the basis of the dot designation alone in 
areas of finely mixed land uses, the dominant land use was assigned with priority to file lower-valued cate- 
gorical urban codes (versus the higher-valued cat^orical rural codes). 

Selective display graymaps of this data plane show that recent developments in land use, such as clustered 
development, occurred on the periphery of existing urban lands and placed new pressures on adjoining 
open-space lands by providing nodes or springboards for additional urbanization (figures 1 5 and 16). 
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The 1963 land-use data reduction efforts were structured to minimize coding and keypunching duplication 
of the completed 1970 land-use data plane. This was accomplished by comparatively interpreting individual, 
nonstereo 1%3 aerial photos to the 1970 orthophotomaps for land-use changes. Only the areas of change 
were coded and keypunched into a 1963 land-use change plane, on which the areas of change were punched 
as solid geometric blocks and unchanged areas were left blank. The resultant change data plane was logically 
merged with the 1970 land-use data plane to generate a complete 1 ' 3 land-use data plane (flgures 1 7 and 1 8), 

The uses land-use map of Denver (figure 19) was also manually san.pled from a 1 : lOO.OOO-scale land-use 
classification map (figure 20) encoded in the USGS Circular 671 system (Reference 1 1 ). This USGS map was 
compiled from 1 : 1 21,000-scale, high-altitude NASA U-2 color infran-d aerial photos of 1972-1973. A 90- 
percent minimum accuracy value for this photointerpreted land use was claimed for this product although no 
verification procedures were described. A 1 : 1 00,000-scale mylar copy of this map was obtained from the 
USGS Public Inquiries Office in Denver. It was nin through an ozalid blueprint machine in contact with a 
matching dot sampling grid on mylar to produce a dot-overprinted paper map for the keypunching operation. 
Both map and reduced transparent grid were carefully registered b-'^'-'ie printing. Six major firet-order cate- 
gories were defined, with 21 second- and third-order land uses defu. j in the Denver study area. 

CONSTRUCTION OF PHYSIOGRAPHIC SUBMODEL 

Topographic elevation data was manually sampled from sixteen 1 : 24,000-scale USGS 7.5-minute quadrangle 
sheets (figure 5) with the data overlay form (figure 7). The elevation of each cell was estimated to the neatest 
3 meters (10 feet) and noted next to tiie appropriate sample dot. The 64 sheets of elevation values wea* 
punched and assembled to provide a completely fdled topographic elevation data plane (figure 21 ). 

This basic elevation data plane was input to separate computer programs to generate additional “derived” ’ 
topographic slope, topographic aspect, and imj«e insolation (solar radiation) data planes. Tlie TOPOMAP 
pri^ram (References 16 and 17) computed both slope and aspect from the uniform grid of elevation points. 
Tliis was accomplished by fitting a regression surface to three-by-three arrays of cells of the elevation data 
plane and assigning the slope and aspect to the center cell of each three-by-three array (Reference 20). The 
repetition of this process cell by cell over the entire elevation data plane yielded the slope (figures 22 and 
23) and aspect (figure 24) data planes. 

Three-dimensional perspective graphics provide familiar oblique viewing angles and convey the spatial 
concepts and relationships represented by the topt^raphic data planes (figure 25 J. Additional computer 
graphics permit the preparation of vertical contouring (figure 26) and perspective contouring (figure 27) 
displays of the topographic elevation data. Physiographic relationships of landscape components to topo- 
graphy may be visualized by perspective graymapping of a second data plane; for example, all 1970 urban 
and built-up land (figure 13) on the oblique repn.*sentation of the topographic data plane (figure 28). 


Potential incoming solar radiation (insolation) was computed by program 1NSOL2 (References 21 and 22) 
lor the L-andsat-l image to be subsequently introduced. This cell-by-cell computation was completed as a 
function of time and date (solar altitude and azimuth), location (latitude and longitude), and topographic 
elevation for ancillary data cells with a computed slope and aspect. This computation yielded a measure 
of both direct and indirect potential insolation for each landscape cell. Tlie calculated fiat-surface insolation 
was ratioed by an actual fiat-surface measurement from the Pawnee National Grasslands just northeast of 
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Display seal# 1:260.000 


BLACK • Sintpe-amlmuHipla-unifrstidmtiaidwdlinsilcoclall) 

GRAY • Opan and othar urban land (coda 19) and cropland and pisture 
agricultural land (cofia 21) 

WHITE • All other land uMt 


display of the 1963 LAND^SE DATA PLANE EMPHASIZING SINGLE- AND 
MULTIPLE-UNIT RESIDENTIAL AREAS (In black). The microfilm display of each 10-acre square cell 
maps die 24- by 24-mile Denver Metropolitan Area. 
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BLACK > Strip and elwtarad daralopimnt (coda 17) 

BRAY ■ Urbanandbuilt-uparaai(aodat11, 12,13, 14, 16,and16) 
WHITE • All othar land umii 


FIGURE 18. DISPLAY OF THE 1963 LAND-USE DATA PLANE EMPHASIZING THE LOCATION OF 
STRIP AND CLUSTERED DEVELOPMENT (In black) RELATIVE TO OTHER URBAN AND BUILT-UP 
AREAS (in gray). The n.icrofilm dtiplay of each 10-acre tquare cell maps the 24- by 24-mile Denver Metro- 
politan Area. 
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Ditj^ay ted« 1;'*'J0,000 


FIGURE 19. DISPLAY OF THE ORIGINAL 1972-1973 USGS 1: 100,000-SCALE LAND-USE SOURCE MAP. 
I*’** "Iff ff''**^** * ‘**^*®* comparison with tiie cellular representation of this source map as a data plane in 
iT^t NASA U-2 coior infrared aerial photos taken at a scale of 

7^® ’ classification scheme was employed. A 90-percent minimum accuracy for 

manual photointerpretation was claimed (Reference 11 ). « y tor 
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Ditplav ted9 1:280,000 


BLACK • Singl*>ami imiMptoHmit ratidmtial dwaHingi (aoda 11) 

GRAY • Opan and othar urbM land utai (aeda 19), nonirrioatad cropland (coda 211), 
irriptad cropland (coda 212), pattur# (coda 213), pan (coda 31 ), and 
chapparal (coda 33) 

WHITE • All r-thar land uMi 


FIGURE 20. DISPLAY OF THE 1972-1973 US(3S LANO>USE DATA PLANE EMPHASIZING SINGLE- 
AND MULTIPLE-UNIT RESIDENTIAL AREAS (In blade). Tha microfilm display of each 10-acra square 
cell maps tha 24- by 24-mile Denver Metropolitan Area shown in tha original map in figure 19, 
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Dbpiay tMla 1:260,000 


BLACK ■ LowvMdtvMioitt <5,110 fMt and Mow) 
GRAY • Intarmadiataalavatiom <5,110 to 5,780 faat) 
WHITE • Highatt aiavationt <6,780 ftotami abova) 


riSLcT J?r topo<5rai»hic elevation data plane emphasizing the 

LOWEST areas <in black). Etavation data was manually umplad from tlxtaan 1:24.000^la US6S 
qM^tfa theatt for 36,864 individual IGacra ceilt with a IGfoot vertical raralution. Tha microfilm diiplay 
of each 10>acra square call maps the 24* by 24>miie Denver Meffopoiitan Area. 


original PAOBJ: 
OP POOR 



0PP00RQUAU1> 



Display scale 1:2SBO,000 


BLACK • Plat areas (0.4 percent and lass) 

GRAY ■ Intermadtete slope areas (0.4 to 5.8 percent) 
WHITE. ■ Higher slope areas (6.8 percent and greater) 


sI.opL^f\. topographic slope data plane emphasizing shallow 

f«r . ! in * ‘«*p'«ved In figure 21 were used to compute topographic slope 

Metropolitan*Aw ° <««Plav of each 10-acre square cell maps the 24- by 24-mile Denver 




























Disptay scale 1:250,0(K) 


BLACK ■ HH^ar ilopa aran (5.8 parcant and graatar) 
GRAY - lntarmadirtaslD|Maraat(0.4to5.8paroant) 
WHITE ■ Flat araat iO.4 parcant and lati) 


FIGURE 23. DISPLAY OF THE TOPOGRAPHIC SLOPE DATA PLANE EMPHASIZING STEEP SLOPES 
(In black). Tha graphic display values of figure 22 were simply reversed to emphasire different slope 
components. The microfilm display of each 10-acre square call maps the-24- by 24-mlle Denver Metro- 
politan Area. 
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Ditptay tale 1:260,00C 


BtACK • NorthwMt'feeing arMi 
ORAY • Intarinadiata atpaett 
WHITE ■ Southaaat'faeing araai 

bL^Vm^ Jte*?!***^^ TOPOGRAPHIC ASPECT DATA PLANE EMPHASIZING NORTHWEST- 

ACIN6 ARE^ (In black). Hand-coded elevations displayed In figure 21 were used to compute topoipaphlc 
aspect for each IDacre calL The reader should Interpret the display as Illuminated from the lower right-hand 
comer (L^, from the southeast) so that the terrain shadows fall away from the observer. The mioofllm display 
of each 10-acre square cell maps tfia 24- by 24-mlle Denver Metropolitan Area. 







FIGURE 25. THREE-DIMENSIONAL PERSPECTIVE DISPLAY OF THE TOPOGRAPHIC ELEVATION 
®ATA plane looking from the southeast. The data iniMJt coroitU of tha elevation data plana. 

The user defines the line of sight by specifying the viewing point and the point looked at In the resulting 
perspective plot the hidden lines are removed. The 24- by 24-miie Denver Metropolitan Area is mapped by 
aitematerowsandcoitHnnsof 10-acre square cells. 

Fort Collins, Colorado, at the time of the August 1 5, \ 91 ^, Landsat-1 overflight. This introduced an esti- 
mate of the atmospheric attenuation in the area at the iinie of the Landsat overflight. This attenuation 
factor was applied to derive final ground insolation estimates for each 4-ha (10-acre) ground data cell 
(figure 29X These collateral insolation data will Be evaluated to determine if they can be used during 
machine landntse classification efforts in topographically shaded areas where the lack of uniform Ulumi- 
nation altered the spectroirradiance measured from otherwise identical land-cover types and/or surface 
materials. 

Generalized suificial geologic data was manually sampled from a USGS map prepared in cooperation with 
the Denver Board of Water Commissioners and the Colorado Water Conservation Board (Reference 23). 
This 1 :62.50(X«cale map was sampled with a reduced version of the original dot grid. Twelve categorical 
geologic cla^s were originally defined on the map. Subsequently, a thirteenth class was added, consisting 
of the prominent water bodies shown on the map (figure 30). 
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Display scale 1:250,000 


FIGURE 26. CONTOUR MAP OF THE TOPOGRAPHIC ELEVATION DATA PLANE. The 5400 6200 
ami 7000 loot elevation contoui lines are shown. These 800-loot elevation contours of figuie 21 aie viewed 
in the conventional vcitical map format. The 24- by 24-mile Denver Metropolitan Aiea is mairpod by alternate 
lows iind columns of 1 0-act o square cells. 
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Ur) T«min «omonr 
displaywith 
turfaM grid 


(b) .T*nrain eontour 
display without 
tiirfaMgrid 


reSSreCTIVE OiSftAV Of THE TOfOORAPHIC ELEVATION DATA PLANE 

TN. j,. E, E“Ej;srE;'Ntr.'rr ^ 


CONSTRUCTION Of TRANSPORTATION SUBMODEL 

A 1971 road-classification map for the Denver Metropolitan Area was obtained from the State Highway 

1 It mil!*''*! expressways, principals; majors, minors, collectors, and feolated 

mi^ road^ This 1 :45,000-scale map provided the point data plane for the computation of minimum 

distance planes and was sampled with a reduced scale, point-sampling grid. The linear road classes were 
"®arcsf-neighbor grid point to provide a continuous representation in cellular form. When 
multiple r^d classes intersected or existed iri the nearest-neighbor vicinity of a dot point, the decision was 
made to choose the highest capacity road class. pomi, me aeusion was 

These point and linear transportation features were transformed into area planes and overlaid on the land- 
scape model. The locations ol freeway interchanges are an example of a typical point feature that has an 

function of the distance away from the interchange and was best handled in the landscape model in terms 
ol a data plane repre-senting minimum distances. The freeway interchanges were initially tabulated into a 
point-type data plane that recorded their location in the nearest 4-ha ( 10-acre) cell (figure ^ I ). This initial 
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FiGUr.e 2a tHREE DIMENSIONAL PERSPECTIV.* DISPLAY OF THE TOPOGRAPHIC ELEVATION 
DATA PLANE EMPHASIZING ALL URBAN BUILT UP LANDS (in bl«cM. Th« ditpisy tystwn genwated 
this plot by Bret deftning the feitain « a atatittical surface on which the urban land uses (code 1 1 through 
19, inclusively) were then overlaid at graymapped celts. Like 3-D contouring (figure 27), perspective qray- 
mapping is a useful graphic tool for vim^izing dw numeric and/or spatial relattonthipt of landscape vari- 
aUet. The 24- by 24-mile Denver Metropolitan Area is mapped by alternate rows and columns of 10-acre 
square ceils. 

data plane was subjected to numeric computation so that the minimum distance in an east-w«t and/or north- 
south sense was computed for each cell in the data plane to the nearest cell occupted by a freeway Interchange 
(Reference 24). 


The minimum distance computed In this fashion was recorded at the position of the selected cell, and the 
computation was completed for each cell In the data plamc. This transfonned the original point plane into a 
useful minimum-distance area plane, which was overlaid onto the landscape model (figure .12). Similarly, the 
initial data planes representing linear road features (figures .1.1a, 3.1c, and 14a; were computationally converted 
to area planes for overlaying on (he landscape model (figures ,1.1b, .l.ld, and .I4b). An Initial urban built-up 
area classification was established In recognition of the area nature of the fully developed street network and 
the established land-use patterns found in extensively developed urban areas (figure ,14c). Again, an urban 
built-up area minimum-distance plane was numerically derived from this source data plane (figure ,14d). 
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BLACK > ShMiowwl northwMt-lteiiia artM cMitil«ngl«y* •nd iMt) 

QUAY • lntarm»dlat**wil»iMa» (1132 to 121.4 cwttlton9l«yt) 

WHITB • Fully MnlitioiitlMUt4aeingtrMt(121A«ntil«fialay*M4irMtar) 


FIGURE 29. DISPLAY OF THE INSOLATION DATA PLANE EMPHASIZING AREAS OF LOWEST 
INCOMING SOLAR ENERGY (In btack). NMHmtantanaous lolir radiation for the overflight time of 
the Ai'gutt 16, 1973, Landiat-1 image was generated from the computed slope and aspect for each 10-acre 
landscape cell. The microfilm display of each 10-acre square cell maps dfe 24- by 24-mile Denver Metro- 
politan Area. 
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Display icda 1:250 

BLACK • Wlnd-dtposhwl slK, sand, and cobblas on foothill slopes (Holocena 

DARitEfiT GRAY • Snd*oraval. sOt, and day (Pldstocane and Holocene period 
darkest gray Bfoadway. and Louriers Alluriyms) 

FIGURE 30. DISPLAY OF THE SURFIClAL GEOLOGIC DATA PLANE EMPHASIZING AEOLIAN 
DEPOSITS (in Mack). Thirteen generdized surficiai geologic units were manually sampled * 

1:62,600^le USGS map. llte microfilm display of each 10-acre square cell nwps the 24- by 24-mile 

Denver Metropolitan Are^ 






Display scale 1:250,000 

FIGURE 31. DISPLAY OF THE 1971 ROAD TRANSPORTATION DATA PLANE SHOWING FREEWAY 
INTERCHANGES. Eight other road access categories were also manually sampled from a 1 :45,000-scale state 
metropolitan road classification map. Some planners have speculated that freeway development is a powerful 
inducement to land-use conversions. This hypothesis was statistically tested through the spatial landscape 
modeling process. The microfilm display shows each 10-acre cell containing a freeway interchange in the 24- 
by 24-mile Denver Metropolitan Area. 






Display scala 1:250,000 


rkl VO " J**"**"""! <*»«•««•» to fraaway intarchanses (3,300 faat or lass) 

GRAYS ■ intarmadiata to furthast distances to freeway inter chanaes (five 3 300- 
t’** ‘"**"»*» 3.300 to 16,600 faat or mora) 

WHITE • Fraaway intarehanoas... 


^ display of the minimum distance to freeway interchange data PLANP 

EMPHASIZING MINIMUM DISTANCE (in black). Freeway interchange acceJdr(fZ?e to 

Mmpute minium distance in a north-south and/or east-west tijverse from each cell to the nearest freeway 
inmchange. The microfilm display of each 10-acre square cell maps the 24- by 24-mile DetrMetr^^^^^^^^ 
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Display seal* 1:600,000 


BLACK * Miirimiim distanea to road typa of intarait (3,300 faat or lass) 

QRAY3 • intarnwdiata to furthast dlstancas (3,300 fMt or graatar) 

WHITE • Road typa of iittarast 

FIGURE 33. DISPLAY OF THE 1971 FREEWAY AND MAJOR-ROAD TRANSPORTATION DATA 
PLANES AND THEIR ASSOCIATED TRANSFORMED PLANES THAT EMPHASIZE MINIMUM DIS- 
TANCES (In black). Freeways (a) and major roads (c) were manually sampled as linear features from a 
1 :46, 000-scale road-classification map. These data were used to compute minimum diste-'sn ((b) and (d)j 
from each cell in the plane in a norOl-south and east-west traverse to the nearest cell coiiu. .ling the given 
road type. The microfilm display of each 10-acre cell maps the 24- by 24-mile Denver Metropolitan Area. 
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(el Built*Up Urban Areas (d) Minimum Distance to Btfii^Up Urban Areas 


Display scale 1:500,000 

BLACK <■ Midmum distance ton road type of intarast (3,300 feat or lass) 

GRAYS <• Intarmadiata to furthast distances (3,300 feat or praatar) 

WHITE • Road/araa class of interest 


FIGURE 34. DISPLAY OF THE 1971 MINOR*ROAD AND BUILT-UP URBAN AREA TRANSPORTATION 
DATA PLANES AND THEIR ASSOCIATED TRANSFORMED PLANES THAT EMPHASIZE MINIMUM 
DISTANCE (In black). These road-access data ((a) and (c)l were used to compute minimum distance ((b) 
and (d)l from each ceil In the plane in a north-south and east-west traverse to the nearest cell containing the 
given road class. The microfilm display of each 10-acre square ceil maps the 24- by 24-mila Denver Metro- 
politan Area. 
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CONSTRUCTION OF SOCIOECONOMIC SUBMODEL 


Socioeconomic data for the United States is tabulated by census at I0»year intervals. These data are re- 
ported in a tabular form referenced to maps of the census tracts or smaller tabulation units called enumera- 
tion districts. These census- tract reference maps (figure 35) were sampled by a dot pattern so that each 
4-ha (10-acre) cell was assigned to a specific census tract on the resulting data plane. This procedure per- 
mitted the tabular statistics to be input in a list format and projected into area-type, socioeconomic data 
planes of population, housing, income, car ownership, and census-tract acreage as follows: 


Socioeconomic Variable 


Intermediate . .. 
Processing 


Total population 

None 

Total families 

None 

Total year-round housing units 

None 

Total vacant housing units 

Ndne 

Total occupied housing units 

None 

1969 mean family income 

None 

Median housing-unit value 

None 

Median housing-unit rent 

None 

Total one-car families 

None 

Total two-car families 

None 

Total three-/three-plus car families 

None 

Total census tract acreage 

None 

Population density per acre 

Normalization 
by census 
tract acreage 

Average number of families per acre 

Normalization 
by census 
tract acreage 

Average number of year-round housing units 

Normalization 

per acre 

by census 
tract acreage 

Average number of vacant housing units 

Notmalization 

per acre 

by census 
tract acreage 

Average- number of cars per family 

Average value 
computed 


Unfortunately, the size of the census tracts is far coarser than the 4-ha (lOacre) resolution of the landscape 
model. Consequently, these socioeconomic data planes are not as highly resolved as desired in a spatial sense, 
but they provide a reasonable approximation of the spatial variatioh of these census variables. 
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FIGURE 35. CENSUS TRACT OVERLAY OF THE DEN- 
VER METROPOLITAN AREA, The boundaries of each of 
these census tracts as defined for the Denver Metropolitan 
statistical area for 1970 were overlaid onto the 10-aae 
cellular network. Socioeconomic census data were intro- 
duced into the landscape model in this fashion. 



INSET A 








Seventeen socioeconomic data planes were generated for the 241 census tracts in the Denver study area* ) 

Total number of families (figure 36), total families, total year-round housing units, total vacant housing ? 

units, total occupied housing units, 1969 mean family income (figure 37a), median housing unit value j 

(figure 37b), and median housing unit rent (figure 37c) were directly projected into the landscape model, | 

Census-tract acreage was computed from manual dot counts and used to normalize the population, family. j 

and housing totals. Thus, a preliminary data plane of total population (figure 38a) for each census tract ; 

was divided by that tract’s acreage, and the resulting population density per acre (figure 38b) was projected | 

into all of that tract’s cells, yielding a meaningful population density data plane. Similarly, the initial data 
planes representing the average number of families, yea^round housing units, and vacant housing units per 
acre were computed and projected into densityrtype, socioeconomic census data planes and were overlaid 
onto the landscape model. 

One- (figure 3 ‘>a), two- (figure 39b), three-, and three-plus-car family totals (figure 39c) per census tract were 
also transcribed Into initial data planes, summed, and divided by total number of families data plane on a cell- 
by-cell basis to derive a data plane representing the average number of cars per family (figure 39d). 

SUMMARY 

Constmetion of a landscape model for the Denver Metropolitan Area Involved 4-ha ( 1 0-acrc) dot-grid sampling 
of selected land-use, physiographic, socioeconomic, and transportation map variables to generate a total of 34 
meaningful data planes. These map data planes were spatially registered to each other so that they could be 
“stacked” or overlaid for subsequent computer analysis, modeling, and spatial/temporal prediction (figure 3). 

The selection of collateral map data for inclusion In the landscape model was fundamental to the success of ^ 

this effort. The data employed were extensive, dispersed, and complex, and were obtained from many widely | 

varying sources. The variables ultimately chosen were selected for their suitability and availability. I 

These map variables were substantial in number and exhibited considerable diversity In the broad realms of | 

cultural and physical landscape components. However, they were simply only a sample of all possible variables j 

pertinent to landscape modeling. Such aspects as utilities service, domestic water supply, land ownership and j 

parcel size, land values, soils, and zoning regulations were among the many variables considered, but not In- I 

eluded in this study (Reference 1). If, by some happenstance, one or more of these data were available for i 

inclusion, other data equally important to the study of land-use practices in the Denver Metropolitan .Area . 

would still be unavailable. 

Because there were no Immediate limitations to enlarging the set of variables, the underlying understanding of 
land-use practices constituted a limitation in the determination of which mapping variables should be included 
In the data set to fully inform the landscape analysis effort Hven if this considerable problem were resolved, 
the most significant remaining limitation is the simple lack of data or “lack” of data In any usable form. This 
is particularly true in developing countries (Reference 25), but was no less the case in regional studies such as 
this, in which extensive spatial data are required and generally believed to be readily available. 

i 

The simplest way of inputting each data plane into the landscape model was adopted. Tliis ensured that the ’ 

majority of the resources and effort would be channeled into analytical endeavors. Tlie thrust of eariier studies 
involved more complex map data entry and computer data structures, resulting in curtailed analysis efforts. 
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Display scale 1:250,000 


BLACK « Maximum total families per census tract (2772 to 3080 femilies) 
GRAYS = Minimum to intermediate total families per census tract (7 to 2772 
families) 


FIGURE 36. DISPLAY OF THE 1970 SOCIOECONOMIC DATA EMPHASIZING THE HIGHEST TOTAL 
NUMBER OF FAMILIES PER CENSUS TRACT (in black). Eleven other socioeconomic parameters were 
also manually sampled from ); 84,500-scale census maps. The microfilm display of each 10-acre square cell 
maps the 24- by 24-mile Denver Metropolitan Area. 
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Display tale 1:500.000 


(a) 1909 Maan Family Inooina Par Canswi Tract 


BLACK • Maxiimim maan family 
income per aratts tract 
($31 .890 to $36,433 per 
family) 

Q(^AY$ ■ Minimiim to i ntermediate 
•rtaan family income par 
tract ($4,399 to 
$31,890 par tamily) 

^ese three socioeconomic summary statistics were manuall : 
play of each IDacre square all maps the 24 - by 24 -mile Oen%< 










tMtptoytal* 1:600,000. 


MOI? 18 



Display soda 1:500/)00 


(b) Madian Housiim<Unit VUoa Par Oantus Tract 


(c) Madian Hoiidng4Jnit Rant Par Cantus Tract 


BLACK “ Maximum median housing-unit 
valua par census tract ($45/)00 
to $60,000 par housing unit) 
GRAYS* IMnimum to intarmadiata madian 
housing-unit value ($7,700 to 
$46/KW par housing imit) 


BLACK ■ Maximum madian housbigHinit 
wit p ar census tract ($270 to 
$300 per month) 

GRAYS Minimum to intermediate rhadian 
housing-unit rent per census tract 
($49 to $270 per month) 


/ sampled from 1 :84,500 scale census maps. The microfilm dis- 
er Metropolitan Area. 


FIGURE 37. DISPLAY OF THE 1970 S0CI0EO3N0MIC 
DATA EMPHASIZING HIGHEST 1969 MEAN FAMILY IN- 
COME, MEDIAN HOUSING-UNIT VALUE, AND MEDIAN 
HOUSING-UNIT RENT (in black). 
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BLACK* Kfiximuin total populadon 
par aantoi tract (II^MO to 
12/W4paraom) 

QRAYS.P Anninmimteintannaiiiata 
total populatioiMpareantiik 
tract (39 to 11^40 parsons) 


Diqilay scala 1:500,000 


(a) Total P op ula t ion Piar Cansua«Tiaet 



BLACK ■ Maximum population 
dansity par eansus4ract 
acre (nj to 68 J parsons 
par acre) 

GRAYS* Minimum to intarmadiata 
population dansHias pa? 
cansus^raetacra (0GD4to 
52J parsons par acra) 


Display scala 1:500,000 


(b) Ropulation Dansity Rir Gansus-Traet Acra 


FIGURE 3a DISPLAY OF THE 1070 SOCIOECONOMIC DATA EMPHASIZING HIGHEST POPULATION TOTALS AND 
AVERAGES (in Mack). Total population (a) waa manually samplad from 1:84,500acaia cansus maps. Thasa totals wara 
dNidad by ccnsua-tract acroaos to compute population density (b). Die microfilm display of each IDaera square call maps 
die 24> by 24miilc Oanwar Metropolitan Area. 
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(•» Total Ona-Car Famillet Par Census Tract 


BLACK* Maximum total ona-carfemiliat 

par census tract (1736 to 1928 
familiat) 

GRAYS ■ Minimum to intermediate 
total one-car families per 

census tract (6 to 1735 

lilies) 



Di*l^ scale i:50(N 


0>) Total Two-Car PamOies Per Census Tract 

BLACK* Maximum total two-car 

familisfs per census tract (1584 
»1760(»mllies) 

uRAYS * Minimum to intermediate 
total two-oar femflies per 
census tract (11 to 1584femiHes) 


or cars per family was computed from the one- two- three aoh ^ 

10-acre square cell maps the 24- by 24-mile Denver Mefropolitan aZ 
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(c) Total Three and Three-Plus -Car Famines Per Census Tract W) Awrage Number of Cars Per FamUy 


BLACK > Maximum total three-, and 
three-pius-car families per 
census tract (425 to 472 
femiliet) 

GRAYS F Minimum to intermediate 

total thrae-,and three-plut-car 
femilies per census tract (5 to 
425 famines) 


BLACK-f Maximum average number 

of cars per femOy (2^ to 3,2 
cars per temiiy) 

GRAYS* Minimum to intermediate 

average number of cars per Rimily 
(0.7 to 2JB cart per family) 


i<scale census maps. The average number 
planes. The microfilm display of ea^ 


FIGURE 39. DISPLAY OF THE 1970 SOCIOECONOMIC 
DATA EMPHASIZING HIGHEST ONE-, VNO-. THREE-, 
AND THREE-PLUS-CAR FAMILY OWNERSHIP AND AV- 
ERAGE NUMBER OF CARS PER FAMILY (in black). 


63 



collateral ^ta for the landscape model consisted of measurements on 34 map variables for 36 864 
Werir^ t °bs®^ations were point locations selected by a systematic 4-ha (KKacre) do^fd sample 

^ '’72.1973 uses Und.„e map was ato do, .™,leda«l 

submodel-was encode, from ,6 USCS ,«ad- 

f 5 ^ ^ ? tnsolation data planes were computed with auxillaiy computer programs. Sul 

ctal geologic data were also added to the landscape model from a USGS map. 

« ‘Classes was the basic input to the transoortabon 

submodel. The five original point data planes representing composite minor S compSlirrl^ 

freeways, freeway mterchanges, and urban built-up areas were traasfonned into minimuLdistance area ’ 
planes by numenc computation to be overlaid in the landscape model 

tabular swioeconomic data and maps of the 1970 Census were used to create twelve area-tvne 

acre ^ H Car Ownership, and census-tract acreage. Population density per 

, a erage numter of families per acre, average number of year-round/vacant housing unite per acre and 
average number of care per famUy were computed and projected into five density-type data^T^^^ 
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CHAPTER 3 


INTRODUCTION 


SPATIAL LAND-USE PROJECTION 

— ii 


TOi« chapter deals with the structure, testing, and-verirreation of the land-use projection models that were 
developed and tested. The development of the full spatial projection model proceeded in three major steps. 
Fimt an mhml mulhvanate model was developed. The model was then refined by Improving Its logical 
structure. Finally, the model was further reHned on the basis of the lesults of spatial-change replications. 

The development of the Undscape modeling data set formed the nucleus of the initial spatial projection 

from consideration for reasons of suitability and avail- 

^ility, The initial projection model therefore consisted of all land-use change, classes and landscape variables. 
This model was used as a benchmark for further refinements and analyses. 

Eteteiled examination of the initial model run showed flaws in its logical structure, especially the projection 
of changes into infeasible combinations that ignored a priori Icnowledge of prior land use. Therefore the 
second phase of spatial-model development focused on improving the logical structure of the model to 
restrict its projection to those changes that could actually occur, based on their current land use. 

M^eling refinements were based on the testing of alternative formulations on the basic data set. The chief 

ability to adequately replicate its own training or cali- 
bration data This was accomplished by using the model to project 1970 land-use changes on the basis of 
statistics of the known 1963 to 1970 changes. The various spatial projections tested were evaluated by 
examining the total number of changes that were correctly classified over the total number tested. Each 
projec ion was made using the training set sample of known changes on a purely statistical basis, and only 

individual cell revealed in a classification accuracy 


DETERMINATION OF RECENT LAND-USE DYNAMICS 

Projections for future land use in the Denver Metropolitan Area were based on observations of the changes 
that occurred iti the area in the recent past. This required the overlay onto the landscape model of accurate 
detailed cuirent and near-past land-use patterns as described in Chapter 2. Remote sensing imagery with 
accurate interpretation provided the data for the land-use modeling process. It will be subsequently shown 
how these inputs can be obtained and overlaid in a timely and accurate fashion through- computer analysis 
o andsat multispectral digital imagery. However, the development and initial testing of the land-use 
modeling process used carefully prepared, accurate land-use maps interpreted from low-altitude black-and- 
white airphotos tor both a current (19701* and past ( 1963) dates. A uniform land-use classification scheme 
covering .4 land uses was first adopted (table 1 ). As described earlier, a single photointerpreter analyzed 


♦When Ihis dtort first undertaken in 1972. 1 97,! uas the most eurrent airphoto data available. 
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the .urge collection of airphotos for each date, annotating the location of each of the 24 land uses to a 4-ha 
(10-acre) resolution. The two interpretations on the airphotos were then transferred to the sixteen 1: 24,000- 
scale map overlays covering the site. The 4-ha (10-acre) dot-grid patterns were imposed on these map overlays, 
and the 24-class land-use maps were tabulated and overlaid onto the digital landscape model (Reference 1 ), 

Recent changes in land use were computed and displayed from the 1963 and 1970 land-use data planes over- 
laid in the modei. This employed the cell-by-cell comparison of the 24 land uses Interpreted for the two dif- 
ferent dates, which yielded a third data plane that recorded the changes in land use between the two dates. 

A direct visual display of this new plane can illustrate its contents by highlighting the areas of gain or loss of 
each land use (figures 40 and 4 IX Summation of the cells that changed between the two dates for each of 

the 24 land uses provided a quick insight into those categories that were rapidly changing during the 7-year 
interval (table 3). 

T^c detailed cell-by-cell comparisons of the land-use type for each of the 36,864 cells on each date also pro- 
vided a simple matrix that contained the number of 4-ha ( 10-acre) cells of each of the 24 land uses of the 
earlier ^te ( 1 963), which converted to another land use by the second date ( 1 970) (table 4). Tliis matrix 
tabulation of recent changes in land use provided the basis for computing the tendency for additional change 
in the near future. These measured tendencies were the essence of the initial land-use trend model when con- 
verted into a probability transition matrix or a square two-dimensional array of probabilities of change arranged 
in rows and columns. 

The assumption that future changes in land use can be measured in terms of those that occurred in the recent 
past permitted a simple projection of the future trends in land use (References 8 and 26). This a<=sumption 
does not truly represent the evolution of real-worid land use, which is constantly subjected to new and often 
unanticipated stimuli. However, assuming no change In practices from the past, the techniques for projecting 
mture land use must be perfected before the impact of new, unmeasured, and unobserved trends can be 
incorporated into the process. 


MARKOV LAND-USE TREND MODEL 

The Denver Metropolitan Area was photointerpreted into a finite number of observable states or land classes 

that could be numbered 1, 2, 3 m. A given mapping cell originally classified asot/ter barren land, for 

example, might be considered as state i; state j might denote open and other urban land, and so forth, with 
state m (perhaps urban residential land use) representing the last state in the land-use model. In this study, 
the total number of states (m) was equal to 34- the number of second-order land classification types in the 
uses Circular 671 system. Although only 21 classes were found in the Denver area during photointerpre- 
tation and 13 land-use classes were absent ih a geographical sense (e.g., tundra), the full 34 classes were 
retained for a full conceptual application elsewhere. 


The probability transition matrix contains 
♦he same or change to some other land use 


the average probability that each of the 34 land uses will remain 
over the-time represented by the dates of the two input land-use 
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BLACK 


GRAY 


WHITE 


1963 urban openipace (code 19) ihat wai 

converted to other land utei by 1970. 

19W errand and pasture agricultund land 
Icode 21) that was converted io other land 
uses by 1970. 

All other land uses. 







Display tealt 1:^0,000 


BLACK 


GRAY • 
WHITE - 


1983 cropland and pasture atfrieultural land 
(coda 21) that was eonvartad to other land 
uses by 1970. 

1963 urban open space (coda 19) that was 
converted to other land uses by 1970. 

All other land uses. 


‘-OSS IN EMBEDDED OPEN SPACE BETWEEN 1983 AND 1970 
agricultural LAND THAT WAS CONVERTED INTO OTHER LAND USES 
Un black). The amount and characteristics of the acreages converted can be found in tables 3 and 4 
The microfilm display of each 10-acre square cell maps the 24- by 24-mlle Denver Metropolitan Area. 
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TABLE 3 

NET CHANGES IN 1963 DENVER LAND USE RELATIVE TO 1970. Specific net transitions, such as those illustrated in 
figures 40 and 41, that occurred between various specific land-use classes are given fiere. Computer comparison of the land- 
use data planes illustrated in figures 13 through 16 provided these simple comparisons. 


Land-Use Type 


1970 

Acreage 

1970 Acreage 
(net gain 1+) 
or loss (-1) 

Residential 

64,210 


+ 6,290 

Commercial and services 

11,020 

■RRH 

+ 800 

Industrial 

8,870 

10,610 

+ 1,740 

Extractive 

4,630 

6,360 

+ 1,730 

Transportation, communications, and utilities 

7,290 

8.650 

+ 1,360 

Institutional 

31,250 

31.590 

+ 340 

Strip and clustered development 

13,500 

16.550 

+ 3.050 

Mixed urban 

40 

0 

-40 

Open and other urban 

37,410 

35,620 

- 1,790 

Urban subtotal 

178,220 

191,700 

+13.480 

Cropland and pasture 

160,090 

146,240 

-13,850 

Orchards, groves, and other horticultural areas 

60 

60 

0 

Livestock feeding operations 

20 

20 

0 

Other agricultural land 

330 

70 

- 260 

Agricultural subtotal 

160,500 

146,390 

-14,110 

Deciduous forest land 

180 

180 


Streams and waterways 

960 

1,010 

+ 50 

Lakes 

5,930 

6,410 

+ 480 

Reservoirs 

1,580 

1,750 

+ 170 

Other water 

50 

50 

0 

Water subtotal 

8,520 

9,220 

+ 700 

Vegetated nonforested wetland 

1.710 

BHDEOllHi 

0 

Sand other than beaches 

640 


- 120 

Other barren land 

18.870 

IHISH 

+ 50 

Barren subtotal 


19.440 

70 

Grand Total 

368,640 

368,640 

116,060 










































TABLE 4 

CHANGES IN DENVER LAND USE, 1963 TO 1970. Entries show the number of 10 acre cells of each 1963 land use that 
converted to another land use by 1970. A blanl^denotes that uo conversion o curred betVireen the respective land uses during 
the period covered. Entries along the principal diagonal represent the number of cells of the and use that did not change to 
another land use betv/een 1963 and 1970. 



data plaiK’s. Ilicsc stochastic elements aa* summuri/ed in a matriK of the form; 


P = 


»’m P|.j Pi..» 

I’m I’j.j P:..» 

• • Pm 

,P.»4.I P.M.2 P.M,il 


Pi..M 

Pj.14 
P.M..MJ 


whea* p, j is the average probability that a given 4-ha ( ItVacre) cell in state i as of !*)(».< would esolve into 
state j in 1470. Alternately, a briefer form is 


P = jpjjf tori = 1.2 mandj = 1.2 m 

whea* the braces indicate that p, ^ is a typical element of the stochastic matrix. P, the limits of i and j being 
m. lire element, p. is-calted the i, j'* element, and the first subscript refers to the element's row and the 
second subscript, to the element’s column. 


The probability traasition matrix. P = | p. J. between 1%.^ and 1470 for the Denver Metropolitan Area, is 
a'produced with noiUKCurring, or zero land-use change, probabilities deleted for more convenient aderence 
(table 5). lire elements on the principal diagonal of the probability transition matrix, p. ,, aa* significant in 
tliat they represent the proportion of the earlier (14(>3) land use that remained in the same land use in the 
later (1 470) date, Idements not on the principal diagonal are transition probabilities (or proportions) for a 
given land use to change in the given time interval. All rows In the matrix are stochastic vectors, that is, the 
entries sum to one across any row, or it. dot notation. 


m 

■•t = E ".i ■ ' 

j I 

Tile probability transition matrix was input to a Markov projection process that operated on the relative 
amounts of each of the 2 1 land-u.ses present in the two most current land-use data planes ( 1 4(>.) and 1470) 
to project future trends in land use (figure 42), b’ifteen 7-year simulation periods were run from I4(>.) to 
20(i8 and yielded (table (i) rapid decreases in two functional open-space categories-open and other urban 
land (code 14) and cropland and pasture agricultural land (aide 2 1 ). These two land uses in the Markov 
model declined from l.^,^.*? and (>4,8I,) lia (37,410 and I (>0,040 acres), respectively, in |4(>3to 1,280 and 
7,753 ha ( 1 5,340 and 43, 1 80 acres), respectively, in 20(>8. Urban residential land use (code 1 1 ) nearly 
doubled, increasing from 2(>,000ha ((>4.2 10 acres) in I4(»3 to 50,700 ha (12I,(>00 acres) in 20(>8, 

riie Markov trend model was intended to augment the land manager’s knowledge and experience of the 
future implications of current actions and to permit more thorough searches for “best” ecomanagement 
decisions. Although these current trends can be mathematically projected as far into the future as tlcsircil. 
only the first few lime periods are reliable. A major limitation of all such models is the assumption of 
homeostasis in that (he probability transition matrix of conversions. I’ - |p, of land from type i to 
type J is invariant and constant during all future simulated lime periods. A more serious shortcoming is 
the fact that it was simply a descriptive tamd model, and, although it indicated general trends over time. 


75 


? I 


o 


o ^ 


lb 



o/ 




"ji ' 


,- : 

t> * " «• O. 


“V - An < 
V . ■ '••• 


»-. -33 - 



n o 


rf» • 


Sun. 



FIGURE 42. PREDICTION OF FUTURE TRENDS IN THE AMOUNT OF 
OPEN SPACE AND COMPETING LAND USE IN THE DENVER METROPOLI 
TAN AREA. A constant matrix of transfers, P | P, ,f . land use, i, to 
land use. j, was assumed. Acreages displayed relate to tlie original area of 24 by 
24-statute miles 576 square staiute miles. 


it did not provide spatial land-change inl'onnation on a point-hy-point basis. I•inall> . only gross interpretations 
can be made ol the long-term evolution ol land use Irom this model. New and unanticipated controls on the 
use of the lattd can quickly tKcur and impact future urban growth and attendant land-use patterns. 


MULTIVARIATE LAND-USE SPAT4AL PREDICTION MODEL 

llte land-use modeling ellort was next addres.sed to the design and structure of a spatial temporal land-use 
change model that could be used for map predictions, fhe Markov chain process, in which the probability 
ol transition Irom one state to another is the conditi 'nal probability, p (j|ik was a useful |^>int of departure. 
However, its simplistic matrix multiplication of land-cover types was completely inappropriate for spatial 
.inal^sis and prediction. Like most regionalization problems, this next level of complexity must character- 
istically be address^Hl to more numerous geographic obsi*rvalions where each consists of measurements on a 
huge number of variables. It was therefore apparent that any spatial land-use modeling would be impwsible 
without commitment to a full multivariate statistical approach. Htis approach was largely a classification 
exercise and employed the techniques of modern numerical taxonomy (Reference '7). 




































Rdatioifihip of l.«nd-UM Change to Landscape Variablei 

by Its Similarity to other class members aird some level of differentiation from nonclass members Likewise 

!lLo^ flt quantitatively defined by its ’ 

a sotiatcd land^ape parametere for the Denver Metropolitan Area. It was assumed that they exhibited some 
similarity to other cells in the change class and differentiation from nonchanged cdls 

TIk iMpomnt a*«.mp.io« llu,. evils of land use .ha, ha.e unde^ooe similar changes in a partleula, period have 

1.692 and 1.693.4 melers ,5.550 and 5.556 fee,) for the 1 963 and 1970 land use. 

elevation of tlie agricultural land loss during this period was I 670m t S dk(\ a i a c a ^ 

utlon in , he histogram Of , he loss ,ersusel4a.i:n~^ 

mZ^l ' !!""? ^ of "« -^ivultur 1 “r^Sn tLl 

1963 and 1970 was into urban land uses(table 4): 5.414 of 5.607 total hectares! n t7^ nf m «<n ? \ 

were lost predominantly at the lower, more suitable elevations. It was clear that the lower agricultural latX 

fof m" t f suburbanization processes. Thus, a nuilt^ariate rationale 

q an I ying the mathematical properties of land-use change types for all of their physiographic socio- 
conomic. and transportation variables provided a basis for a spatial/temporal land-use JlrojmL model. 

Spatial Modaling Concepts 

One of the ra« widely used multivariute procedures is discriminant analysis (Appendix Al. .1 priori data 
m he lorm ot sclevted sampled of known identity am extracted front the entim sample space and am usixl 
as traimng data to struetum the discriminant function. Tills discriminant function Is then used to classify 

!mrirTt'"*i ^ 7" Z "f W- The discriminant function is a powerful 

slatislieal tool that can be appimd completely fme of statistical knowledge or assumptions. Tliis overall 

approach is eonimonly mfermd lo as distiibution-fme or nonparanietrie elassinealion , Refemnee ’81 All 
elassiliealion algonthms in diseriniinant-funelion analysis can be redueexi to either Hxed or .arying hyper- 
planes ol pattern or teatum s^evs. These hypen>laiK- boundariei may or may no, have bevn detlned in the 
context of known statistical distributions (Reference 29). 

transforms an original set of sample measurements Into a single discriminant 
aoru This score, or translormed variable, represents the sample's position on the line defined by the linear 
discri^nant function. Consequently, the discriminant function can be visualized as a method of telesconimi 
a mult^a^ate . problem into a univariate, linearly ordered situation. ^ ^ 

Diseriminant-fimetioi, analysis sevks to Arid a trausform that gives the minimum ratio of the difference 
bi tween a pmr ol class uravanate means to the multivariate variance within the two eliessesi in the simple 

. pace. Ihe one opt mum ortenlalion ts sought along which the two clusleB have the greatest separation while 
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simultaneously minimizing the spread or inflation of the distribution c each cluster. An adequate separation 
between groups A and B for the two>dimensionai case cannot be made with either variable X or X ( llgure 
44). However, it is feasible to find an orientation along which the two clusters are separated the most and 
inflattfd ihe least, with the coordinates of this axis of orientation being the linear discriminant function. 


CLASSIFY was the modified BM0O7M (Reference 18) stepwise linear discriminant-analysis program used in 
tliis final phase of spatial land-use projection. Linear discriminant functions were computed by entering 
ancillary landstfape variables with a laiigest F-value-toenter criterion. The a posteriori probabilities of each 
land-use case belonging to each of the possible change combinations in land use between I4(>3 and I ‘>70 
were calculated u.sing these functions and either a or equal ( 1/n) group probabilities. Additional pro- 
camming changes to provide tape input and output features, a more useful classification matrix, and a timing 
function to evaluate the cost-effectiveness of each additional stepwise variable resulted In the present 
CLASSIFY program. 


Initial Spatial Modeling Test 


The data planes of the earlier ( 1 963) and current (1970) land use and the derived 1 963 to 1 970 land-use 
changes were overlaid In the landscape model with the 34 ancillary variables. The 2.039 of the 36.864 cells 
In the model that were observed to have changed between the two land-use dates provided the basis for de- 
termining how the landscape variables correlated with changes in land use in a multivariate sense. The 2.039 
cells tliat made a transition from one land use to another over the 7-year test period provided a group of ol> 
servations tor each type ot cliange that occurred. These observations were used as a statistical sample to 
model how the changes in land use will proceed in the future based on physiographic, socioeconomic, and 
transportation landscape variables. 


Tlie Denver Metropolitan Area was mapped with 21 land-use classes on both the initial ( 1963) and subsequent 
( 1 970) dates. Thus, 2 1 --2 1 or n(n- 1 ) cliange combinations were possible. However, only 38 of these possible 
420 changes actually occurred as the earlier probability transition matrix shows (table 5). Many kinds of 
possible land-use changes either seldom or never occur (e.g.. the backward conversion from some higlier state 
ot land use, such as industrialized land, back to a lower state, such as agricultural land). All test cells that 
had undergone a specific transition were assembled so that the 2,039 changes observed between the two dates 
were grouped into the 38 observed coinbinatlcns. The statistical techniiiue of discriminant analysis was 
applied to these observations to form a simulation model by training it to recognize the range in a multi- 
variate sense of each landscaiie variable associated with each of the 38 change combinations. 


A single, simplified test model will serve to clarify this approach. Tlie first spatial land-use projection model 
u.sed all 38 change combinations, together with the landscape variables and equal ( l/n) class probabilities. 
The seven land-use data variables ol the 34 variables overlaid in the landscape model were excluded from the 
independent variables in this and all subsequent analyses so that they represented the results being modeled. 
Linear discriminant analysis combined the remaining 27 landscape variables for each change point by con- 
structing a linear discriminant function of the forn • 


Y = a, X, + UjXj + ... + a^x„ (n=27) 
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FIGURE 44 SIMPLE LINEAR 9ISCRIMINANT-FUNCTI0N DIAGRAM. This plot of 
Mo bivariate distributions shows an overlap between groups A and B along both variables 
and Discriminant-functlon analysis determines a transform that gives the mini- 
mum.ratio of the difference between a .-‘^ir of group multivariate means to i'lui- 

The orientation is computed along which the two 

m!iJ Tk the clusters become distinguishable by projecting 

members of the two populations onto the discriminant function line {Reference 30). 

where X, . .X, . x„ were the landscape variable values, and a, , a. a were coeriicients comnuled 

the linear compound, that mininii/ed misclassification of the 2.0.^) changed ccl 

showiiiK how the " 0 V) -h SV ’^ 1 ^'talysis step printed a cla^illcation mairi 

showing how the ..O.V) changed cells were clas.silled into their respective .t« change combinations The 

evaluation o this and each subseriucnt model's value was based on a figure of merit (l-OM) calcul itcd is 

K^hTIoiy ’LnbT'\' classification along the principal diagonal of the classification matri; divid 
b> llK total number ol Uuinged cells and expressed as a percent. 
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The FOM for the first model was 41.9 percent (table 7). Oveiall, it correctly identified the expected 
change in land use of 8S4 of the 2,039 observed cells that had changed based upon the 27 landscape 
variables (table 8). Because a choice was made from 38 different iand*use change combinations, an 
accuracy of I/n » 38 groups times 100 percent = 2.6 percent would be expected from random assign- 
ment of the 2,039 changed cells to 38 combinations. Although, by comparison, the 41.9-percent figure- 
of-merit was far from perfect, it clearly indicated the feasibility of this spatial land-use projection 
approach. 

The entry order of the change variables may be of some interest to planners. Examination of the order 
of the “F-value to enter” also provides insight into the relative value of the landscape variables in pre- 
dicting land-use change (table 9)t The distance to freeways was found to be an inllucntual factor and 
was selected early by the dassifter. Hdwever, the proximity to the periphery of the urban built-up 
area was evert more important. Average number of families per acre, population density per acre, and 
topograpiuc elevation were also Important descriptors of change for the Denver Metropolitan Area. 

Tests of Other Generic Spatial Models 

An examination of the initial test classification matrix (table 7) showed that it failed to exploit the known 
a priori 1963 land use of all change cells. For'example, the land-use change code 21/19, representing 1963 
cropland and pasture agricultural lartd (code 21) changing to open and other urban land (code 19) in 1970, 
could be classified in any of 37 other remaining land-change combinations. This specific change. In fact, 
was predicted to occur in 27 other land-use change combinations, 1 5 of which were inconsistent with the 
fact that the 1963 initial land use was cropland and pasture agriculture (code 21). Because the initial or 
starting land use would always be known, the choice of a type of change must be made to be consistent 
with the known, initial land use. 

Six additional spatial land-use projection models incorporating the logic of a known, beginning land use 
were tested with varying degrees of improvement over the initial model (table 10). The second and third 
models were derivatives of the initial full 38-class model restricted to the 1963 cropland and pasture agri- 
cultural change cases, 1 ,437 cells, and represented 13 of the 38 change combinations under alternative a 
priori and equal class occurrence probabilities. 

The second model correctly reclassified 822 of the 1,437 agricultural change cases for a 57.2-pcrcent FOM 
(figure 45) with a priori class probabilities (tables 1 1 and 1 2), a significant increase over the comparable 854 
correct cases from among all 2,039 cells (41.9 percent) achieved in the initial model. The third model used 
the same 1,437 agricultural change cells but differed in that it used equal (1/n = 13) class probabilities and 
yielded 756 correct cases for a 52.6-percent FOM. 

The only difference between the second and third models was the use of the a priori versus equal-class 
occurrence probabilities, respectively, which yielded a slight classificational advantage to the second model. 
Equal-class likelihood was used where the actual proportion of the modeled land-use (1970) changes was 
unknown. The a priori probabilities of the second model could be either estimated by remote sensing or 
extrapolated from mathematical simulatum models, such as the Markov trend model. Here, the a priori 
probabilities were simply derived as the ratio of the total cases of each change class to the grand total of 
change cases. 
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TABUE 7 

Th« 27 com- Land-use data were not used, except for identifying the cells that had changed. On the axe;, the numerator is the 

lls were sublocted 1 063 land use, and the denominator is the 1970 land use. Numeric land-use codes are Identified In table 1 . 

accuracy. 

lO-Acre CelU Classified Into 38 Second-Order Lond-Use Combinations 

nq m (21 (21 (21 (21 ai (21 (21 (21 (21 (21 (21 (21 (24 (24 (24 (24 (24 (24 ^73 (73 (73 

51) /52) lU) 112) /13) /14) /15) /16) /17) /19) 124) /5l) /52) /S3) /75) III) /I4) /IS) /19) III) 152) /1 3) /I4) /!9) 


5 1 


3 1 


1 2 


3 3 5 1 


2 1 


218 7 61 

1 57 10 

13 ^ 52 ^ 


29 17 19 


2 2 23 


1 1 


6 2 11 2 


1 35 2 2 5 

3 1 ^ ^ 

10 4 20 3 14 103^ 53 6 3 

30 20 10 2 11 41 115 1 2 17 

^ 2 ^ 


4 5 


6 3 


1 8 


4 7 15 


1 1 


2 1 3 ^9. 


Oft f age is 


j sauxm nm. 
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TABLE 8 

:rr;r,:::r:r'^T:;r“'^""f”“'^^ 



numerical utility SPATIAL LAND USE PROJECTION MODEL. The 

‘Wdinoorde.,TheP..lue.report.we^^^ 

' stoi^ ' 

Number 


Ancillary Lamlsiapc Variable l-ntered 


Average number oriamilies per acre 
Buili-up urban-area minimum distances 
Population density per acre 
l'opt)graphie elevation 
S'reeway minimum distances 
Median housing-unit rent . • 

( omposite minor-road minimum distances 

Average number of year-round housing units per acre 

Median housing-unit value 

lotal ix'cupied housing units 

Average number of housing units per acre 

Total one-car families 

Topographic slope 

hreeway interchange minimum distances 
I.andsat-I image insolation 
I otal three-car families 
I otal year-round housing units 
I t'tal census-tract acreage 
l otal twivear families 
1 otal vacant housing units 
Total families 

Average number of cars per family 

( omposite major-road minimum distances 

Topographic aspect 

Surllcial geology 

l otal population 

l%‘) Mean familv income 


l-Value 
to I'liter 






TABLE 10 

Serillo^TO CHANGE IN THE DENVER METROPOLITAN 

MMt A 1J(,3 TO U/0. Th.!, t.ihlo IS b.isod on punJiotinq tho chjnfl,. in IjiuT use foi tlie cells th.it h.ul umleinone .1 ch inue 
lUmno thiMest penod. The ch.in 9 e w.ts piedieted hy discnmin.,nt en.ilysis from the l.,ndsc 4 |K« v.iiiahles. Tlie .,eait.icv w ,s 
hjsi d on the numiH'i of cells whose futiue 01 1970 land use w.is avrectly fitedicted. ^ 


Model 

Ntittihor 

Model Description 

I’otal 

Sample 

Cells 

I'otal 

Cells 

Correct 

Average 

Accuracy 

(percent) 

1 

l•■ull .T.S-clian}>o class model with |070 equal- 
eltange probabilities 

:.0.7d 

.S.s4 

4l.d 


(Subportion of thirteen cropland and 
pasture agricultural changes) 

(1.4.17) 

(0.1.1) 

(44. 1 ) • • 

> 

Second composite .T<S-change class tnodel 
with 1 ') 70 L//>r/or/ change probabilities 

:.0.ld 

1.2U.S 



iSubportion of thiiteen cropland and 
pasture agrwttitural chatiges) 

(1.4.17) 

(s::) 

(.S7.:) 


Thirteen cropland and pasture agricultural 
changes ol 3.S total change classes with |070 
equal-change probabilities 

1.4.17 

7 .S 0 

5::.o 

4 

Pure 1-^ cropland and pasture agricultural 
change classes with l‘)?o,/ /i/vo/v-change 
probabilities 

1.4.1 

70q 

1 55.0 

5 

Pure 1,1 cropland and pasture agricultural 
chattge classes with Id"?!) equal-chattge 
probabilities 

1.4.1^ 

74"^ 

52,0 

(y 

l ourteen cropland and pastitre agrictiltural 
change notuhange classes with l‘)-o equal- 
chattge probabilities 

:..i4^ 

i.:4i 

52.^) 


^■ourteen cropland and pasture agricultural 
change nonchange classes with ld‘'o,, [triori- 
change pmbabilities 

:..14^ 

U..S 

4<SJ 


Hu I. urih aiul hi I. iiuulols w.'lv simila, ,|,o uvoiul and lliiul iiuuk-ls. uuiuvthvl, . in lhal Hun nsod Iho 
■i.n. ,d. apinniinra diaiipn vasos ropmsaHInf llilruvn I'lTll dianpo coinl.inali.nn, nilh and onnal- 

da» o«nm-iuv im.halnhlios, rosiuvlholy. I Iho uvond and Ihird im.dols, lumovo., Iho .amnio oai 

Hlo. .nnlaniod only tho IT aitnonllnial ohanito oasos, nhotoas Iho Ihroo prothms inndol. dorivod Ihoi. di.- 
cnminanl lunctions iVoni tho riill .TS-dass dianjto comldnations. 

Tho Iniirlh nnnlol onnoolly roolas»ir,od '•») apiloiillnial , hanpo oatto. I.n a y.T.n.por.onl TOM I Hpmo 4<l 
in,.doi“* ' ■ ■’ 01.0.1 aohlovo.1 In Iho o.nnpatahio .oo.nni 
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(e) Thirteen agrieultuAlehangBdasniprti 
dieted hai^e changed from agrieuitur-j 
iand at of 1970 (second modei) 


I 

i 


original pagn ip 

OF POOR 0TTAf.m^ 







(b) All obtenredehangM from agricultural 
land from 1963 to 1970 


(a) Ail 1970 AiP’iculturai LandUia 


J > 

k 

« j Tf 


(d^Thirtaen agricultural change clatiet pra* 
didtad to have changsd from agricultural 
land at of 1970 (fourth model) 


(a) Fourteen agricultural change daiMt pra* 
dieted to have changed from agricultural 
land at of 1970 (tixth modal) 


(a) 6round>truth map derived from the 1970 aerial photointerpretation showing only the first-order agricultural land use. 

(b) (Change detection map derived from a point-to-point comparison of the 1963 and 1970 land-use data planes, (c, d and e) 
Landscape-modeled maps showing only these cells predicted to evolve from agricultural land use at of 1970. 


ORIGINAL PAGP. B 
OF POOR OUAim 


figure 46. COMPARATIVE DISPLAYS OF THE SPATIAL 
PREDICTIONS OF THREE LAND-USE MODELS EMPHA- 
SIZING ALL CHANGES FROM AGRICULTURAL LANDS 
(in black). 


../«a . 
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The initial model wat partitioned Into »even run*, corresponding to each of the basic 1.',| 
classes. Blank entries indicate unfeasible change combinations; dashes Indicate that no celk 
the feasible combinations. Land-use data were not used except to select the change cells, 
identified in table 1. 


Changr 

Code 


Number 


(14 (14 (14 (14 (16 (18 (19 (19 (19 (19 (19 (19 (19 (19 

/ID /13) /21) IS2) /52) /17) /ll) /12) /13 ) /14) /15) /16) 111). 121), 



s: 


( 21 / 11 ) 

( 21 / 12 ) 

(21/13) 

(21/14) 

01/15) 

(21/16) 

(21/17) 

(21/19) 

(21/24) 

(21/51) 

(21/52) 

(21/53) 

(21/75)^ 

(24/11)' 

(24/14) 

(24/15) 

(24/19) 

(24/21) 

(24/52)J 

(73/13)' 

(73/14) 

(73/19) 




n; 


A*s from 1963 extractive land use 
(code 14) 


" 16 - 




(19/11)'! 


^184 

6 

13 

8 


2 

2 

J 

i 

8 ; 

(19/12) 


16^ 

20 

2 

5 

_ 

2 

1 

, i 

(19/13) 


2 

3^ 

24 

11 

2 

1 


1 ' 

• M4) 


2 


i'^ 

27 




j 

3 j 

( >/15) 

A’sfrom 1963 open 

2 

1 

4 

12 "^ 

20 


1 

1 

1 

7 J 

(19/16) 

^ and other urban land use i 

(code 19) 

18 





7 

N. 


. 1 

J 

(19/17) 


3 

- 

3 

2 


_ \ 

24 

6 i 

(19/21) 


10 

3 

- 

6 

3 

— 

5^ 

13 ; 

(19/51) 


— 





— 



(19/52)^ 



- 

- 

1 

_ 













A*s from 1963 cropland and pasttu . 

(code 21) I 





TABLE 12 

ACCURACY OF PREDICTION OF FUTURE CHANGES IN LAND USE ON A CELL-TO-CELU OR SPATIAL 

METROPOLITAN AREA. Thl« table is based on prediotinfl the change in land use 
from 1963 to 1970 for the 2,039 cells In the second landscape model that underwent a change during the test 
M was predicted by discriminant ar.alysis from the landscape variables with reference to the 

uf ■ , on ^ number of cells whose future ( 1970 ) land use was cor- 

rectly predicted (table 11). 


19M Lund-Uw Pscdicted to Clianj^c to 1970 Land-Uiic 

Total 

Sample 

lotal 

Cells 

IVedietion 

Accuracy 

— 


evils 

Correct 

( percent) 

1 1 xtractivo 




A to 

Kesiilential 

IiKlustrial 

t'roplami ami pasture 
lakes 

5 

3 

5 

5 

3 

4 

(»<i,7 

100.0 

100.0 

80.0 

1 Institutional 




U— 

1 lakes 

15 

15 

100.0 

1 MiSv\l I'rbun 




Ato 

1 Strip aiul clustered de\eioptnent 

4 

4 

100.0 

1 Open jiul othor urban 




Ato 

Residential 

C ommercial ami ser\ices 
Industrial 
1 \traeti\e 

rranspoftatioiu eommumeations. and utilities 
Institutional 

Strip and clustered dexelopinent 
C ropland and pasture 
St rea ms a n d \\ a l er w a \ s 
!.akes 

2M 

53 

45 

33 

51 

43 

1 

5 

1S4 

20 

24 

27 

20 

7 

24 

13 

1 

4 

7v).7 
37.7 
53.3 
81. H 
3‘k2 
2i^^) 
01.5 
30.2 
100.0 
80.0 

I OtiplamI ami pasture 1 




Alo 

Residential 

(\mimereial and services 

Industrial 

Ixtractive 

rranspvirtatioiu comimmiealiims, and utilities 
institutional 

Strip and clustered development 

Open and other urban 

Other aitrieultural land 

Streams and vvaterwavs 

fakes 

Reservoirs 

Other barren land 

Mr 

27 

123 

13‘) 

72 

23 

:h2 

344 ' ' 

4 

17 

5 

314 

(> 

42 

HO 

30 

lOH 

|0U 

•> 

3 

4 
17 

3 

M)A 
ss s 

34.2 
01.8 

50.0 
8.7 

41 8 
57.V) 
100,0 

75.0 

18.2 
100,0 
(fO.O 

1 Other ajiricultural laiul 1 





Residential 

1 

1 

100 0 
(0 .“i 
<>'*.2 
100.0 
100 0 
oo 

Ato 

1 xtractive 

I raiisportalion. coniniunicalitms, ami utilities 
Open and otlier land 
C roplami and pasture 
rea ms and vvaterwavs 

s 

13 

s 

3 

1 

5 

0 

> 

3 

0 

1 Sand *)t her than beaehes I 





Industrial 

1 

1 

100(1 
(t(t ' 
100.0 


1 xtractive 

Open and oilier urban 

4 

0 

■» 

lotal 

2,030 

1.20K 

s: u 



Z '” •"»• ro|To«nto.' nl^irricu" 1 hcL 

10 cells, representing u nonchungc cluss, togethei with the 1 4^7 in 1 1 t 

The sixth spatial model correctly reclassif.ed 1.241 cases for a 52.9-percent FOM (figure 45) Only 457 of the 
K-nov/n agneu tural change cases were correctly reclassified for a 30.4-percent accuracy wliereas 804 of the 
s.mjplc(f nonchange agricultural cases were correctly classified as unchanged for a subs^tanf '1 «R ^-pere nt 

Unfortunately, this error of 1 1.6 percent appeared . alerabl. , ■ 
sent/d 1 638 '.‘xtended to the original population of 14.624 unchanged agricultural . ells, i> , . • 

i 7 w! erroneously predicted to undei^o some change, lliis number wa. I; . .,n 

the 437 Known changes observed tor cropland and pasture agricultuial land. 

FOM Xlv 663 probabilities, the last model correctly - >v, ,a . , i a 48. 1 -percent 


Proposed Advanced Spatial M^del 

Ir.rZ'' "'sure 4(.) ™,cr8ca M a synthesis of the Markov trend model 

all the landscape model to predict the next most likely chanse in land use lo7 

acre) cells* Markov trend model provides the munber of these cell^ tlvit u/iii 

n°xt 7^"" 7* After the discriminant model predicts the 

next change m land use and its posterior probability for each cell in the landseano tin* i «tn..i .i . 

future time period can be dckmiined by assembling all changes of each given type from riie ^b'S^irHhf 

constitut ng the entire landscape. The group of cells rc^preseiiting each type oJXan^^^ 

their posterior probability of occurrence. The cormet number of transitions supplied by the Markov tremd 

of t .1 7 "'7 n post'-’rior probability at the top of each of the ordered list 

sn .H H I 7*' r assumed to be unchanged and can be noted as such The exact 

.11 7 7 '.‘'u '* their respective rows and columns The sorted 
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FIGURE 46. PROPOSED COMBINATION MARKOV AND LINEAR DISCRIMINANT MODELS FOR IM- 
PROVED SPATIAL-CHANGE PREDICTION. The Markov trend model provides the correct number of 
change cells by type. These can be selected from a sorted list of discriminant computed posterior probcbill* 
ties of change. The selected cells can be assembled Into a map of future land use. Spatially registered Land- 
sat digital imagery can serve as future land-use Inputs in lieu of the 1963 to 1970 aerial photography. 
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CHAPTER 4 


LANDSAT LAND-USE CLASSIEICATION 


CHAPTER 4 


LANDSAT LAND-USE CLASSIFICATION 


INTRODUCTION 


mcRDiNG PAG# ^^am4 


riio first pair of Earth Resources Technology Satellites, designated Landsat-1 and - 2 * are providing unex- 
celled opportunities to exploa' the utility of spacecraft remote sensing data for a'gionai land-use mapping 
and analysis. Remote sensing is the detection and evaluation of objects without any direct contact. A 
number ot devices collectively known as remote sensors have been developed for collecting and recording 
the eleetromagnetie (EM) energy emitted, reflected, and scattered from terrestrial objects at a number of 
difleamt angles, frequencies, and polari/.ations. . 

Tlie level ol eneigy emitted, reflected, and seattefed from objects of terrain varies with wavelength through- 
out the EM speetaim. Tlie spectral signature of an object by which it is both detected and recognized is 
governed by the amount of energy transmitted to the remote sensor in the EM region in which that sensor 
operates. Tlierefore. an identifying spectral signature can often be developed if the spectral energy observed 
is partitioned into carefully selected bands for analysis. Complementary electromagnetic sensors operating 
in adjacent spectral a'gions are commonly used for this purpose ( Reference 1 ). 

Whether by man. machine, or interactive man/maehine symbiosis, remote sensing data analysis involves pat- 
tern recognition, which categorizes phenomena into elas.ses of interest front a set of measurements. Tlie 
recognition ot patterns and the delineation of boundaries on an image are tasks that a photointerpreter does 
quite well on a single image. Different land-cover and land-use types and their patterns are recognized through 
the use of numerous contextual and inferenUal clues. 'Diese include the spatial relationships of tone, texture, 
and size, shape, and geometry of objects, as well as spectral information obtained by given aerial film/filter 
combinations. 


Conveisely. a collection of images of varying spectroradianee can be input to digital computers to adaptively 
improve their numerical pattern reeognition/classirication efforts using either supervised or unsupervised 
"learning algorithms, ('onsequeiitly. the output from either man or machine is a series of decisions yielding 
a map on the nature and probabilistic relation of unknown patterns to known or learned patterns. 

Although an all-inclusive automated remote sensing daia-analysis system will not be possible for some time, 
digital computers can perform a number of specialized tasks with a huge degree of success. Although these 
processing lunctions are relatively simple in a machine sense, they are not trivial in a munital interpretation 
sense. When an inter|iretation problem demands the simultaneous comparison of multiple statistical patterns, 
as in multispectral r^'»i<rie sensing, the human analyst's capability is severely limited, whereas the computer 
does this without dittlculty. Some specific justifications for developing automated image processing tech- 
niques in remote sensing are: 
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• Tile demand for remote sensing inputs to real-time resource management decision-making is 
rapidly growing. 

• Tile rapid ehanges caused by population growth and technological development necessitates change 
deteetion at short intervals, especially In swiftly urbanizing areas. 

• The derivation of usable information from remote sensing data Is Increasingly needed. 

• Weather and resource satellite programs are generating an overwhelming volume of data. 

• Short supplies of skilled image-analyst manpower obviate against conventional manual Image 
processing. 

• Simple human functions can be duplicated by digital computers to achieve higher input/throughput 
rates, higher degrees of objectivity, and higher accuracies. 

• The liberation of human analysts from routine work through the use of computers makes them 
available for more challenging tasks. 

• Computers can easily manipulate digital input data of high dimensionality, thereby extending the 
realm of image processing beyond limited human capabilities. 

Pattern recognition is concerned with the classification of input data provided in the form of measurements. 

In a remote sensing context, pattern recognition deals with the classification of pictorial/numeric data de- 
scribing terrestrial phenomena and consists of five principal steps: 

1. Input of the measured patterns (e.g., multispectral imagery) 

2. Pattern preprocessing (e.g., calibration and geometric restitution) 

3. Feature extraction (e.g., training set selection) 

4/ Decislon/classlfication (e.g., identification of land use) 

5. Classification output (e.g., map display) 


Only the first and fifth pattern recognition steps arc reasonably discrete. Steps 2, 3, and 4 overlap to a 
greater or lesser extent, depending on the processing problem and the specific procedures used. However, 
the sequence ol these steps Is generally the proper order of the analytical phases. 

A dramatic new dimension for terrestrial remote sensing and automatic image analysis was added with the 
launch ol Landsat-I on July 23. 1972. and was further enhanced with the addition of Landsat-2 on 
January 23. I97.S. Both Landsats are in 92Q.km (.^72-mi) altitude Sun-synchronous polar orbits where they 
complete a full ol^servational cycle of the liarth between 81 degrees north and 81 degrees south latitude 
every 9 days. Hie lornier 18-day repetitive Landsat-I cycle was halved when Landsat-2 was launched into 
a 1 80-degree diametrically opposite orbit. Both satellites carry a similar payload of multispectral sensors 


Und*., m„l,lspc.c,n.l «,m„ ,MSS) da,u m. slmul.anoomly «n«..d and rcvordcd in font spocral bands: 

• 0.5 lo Ob nni (niicroiiKtcr bandwidth of MSS band 4 a-presonlin* visible gn on) 

• 0,6 to 0. 7 (MSS-5 a'prcsenting visible red) 

• 0.7 to 0.8 (MSS-6 representing solar infrared) 

• 0.8 to 1.1 fim (MSS-7 representing solar infrared) 

RBV data are reeorded in three speetral bands as follows: 

• 0,475 to 0.575 fixa tAfjsible biiie-green interval) 

• 0. 580 to 0.680 /urn (visible red interval) 

• 0.690 to 0.830 ;/ni (solar infrared interval) 

o:";-a^;t« otbl,. a„d .hls dvwi. 

scanned in each of the four MSS bands for each mirror sweep with t^ vVh T 

the alon^tmek proga,«l„n of the six spanning lla..*, SutwqLtly dunC N AS A°sTm™ •"'iT™'"' 
th» eontinuous-strip imageiy is transfomied into franied iniL-s w«h a l^n«-nt ™ n .“ 
utive Irames. As franied, each Landsat image covers a 185- by is5-L , 1^ f 

analys‘isot"ll'i^lt^yi^,';i“"'“^^^ The 

ilal MSS data. Lltllc dia-ct photolnterpretation of ihe photwaj^^t 

except in a support role to the comnutor iii iiwic .. »• •*• ^ Landsat imagery was pertormed, 

general geographic referaree ' training-site selection and 
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OVERLAYING LANDSAT IMAGES ON THE LANDSCAPE MODEL 

m!M;r„Z2i ™"'TJ or,.„d™^ ™Se~;;„,<> 

pn».dc a «... ..asu. of p.. 

Review of Image Processing 

may be defined as one or mo. tmnsformafions nsed ,o sample, calibrate, ntstmcture 

• Ulumination (luminance, terrain features, and viewing geometry) 

• Terrain (geometry, height profile. Earth cuivature, and viewing perspective) 

• Atmosphere (absorption, refraction, and background illuminance) 

• Spacecraft motion (orbit rates, attitude, and attitude rates) 

• Sensor optics (lens, aberrations, mean time before failure, and boresighting) 

• Sensor electronics (nonlinear sweep, scene-induced distortions, and detector response) 

• Spacecraft recording (signal-to-noise ratio (SNR), nonlinear tape dynamics, and timing signals) 

• Data links (SNR, transmit ter/receiver, modulation, and signal noise) 

• Ciround recording (SNRy nonlinear tape dynamics, and timing signals) 

• Image processing (ivcording/playback, storage, and image transformation) 

Uiei. .1 Tit. t "'f Z “ P«'^«‘bl8 options end tend to seen lice eceureey 

W a I reprocessing on the other hand, has advantages in near-total nexibillty and higher subsequent 

InTte^ on conventionel .nal-p,oees.,lnf 


1 


104 


Image smoothing and image sharpening are two iocai image-enhwicing operations. The former reduces the 
high-frequency content of an image, thus eiiminating noise. ConverSeiy, the iatter suppresses iow frequencies 
for edge or boundary enhancemeht. Digitai, eiectronic anaiog, electro-optical, noncoherent optical, and 
photographic media are all used to implement these operations in the spatial domain by convolving the 
image-enhancing function with the untransformed scene. The frequency domain can also be used for spatial 
filtering operations by using digital Fourier transforms to generate the imam’s frequency spectrum and then 
reconstructing an image after selective suppression of undesired frequency spectrum components. These 
concepts of image-enhancing operations can be generalized to any form of bandpass filtering to include inter- 
mediate frequencies as well. 

Image-restoring point operations seek to remove radiometric errors. Analog or digital corrections can be 
applied if the errors are systematic and the causative factors, such as luminance, terrain features, and viewing 
geometry are known (Reference 32). 

Although these preprocessing techniques are effective in reducing the effects of Image radiometric and deg- 
radation factors, they were not used because the principal thrust of this study was to test a specific hypothesis. 
However, to the extent that they could potentially improve land-use classification results, these techniques 
should be kept in mind for future application. 

Preprocessing for Geometric Rectification 


The primary Image preprocessing used in this study was geometric image transformation whereby a set of 
picture elements collected on a geometrically distorted grid were converted to samples on another precisely 
rectified grid (figure 47). This method was necessary because the digital CCTs of the Undsat images were 
not geometrically corrected, and their utility for mensuration, precise location, and spatial/temporal regis- 
tration v/as therefore limited. The correction of geometric errors was Imperative because this study required 
Landsat digital imagery that spatially matched ancillary landscape data planes derived from multivariate 
maps of the Denver Metropolitan Area. 

Tlic following major sources of Landsat geometric errors have been cited (Reference 33): 

• Rectangular picture clement 

• Spacecraft altitude variations 

• Spacecraft attitude variations 

• Karth rotation skew 

• Orbital velocity changes 

• Line-scanner time skew 
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o Original Landsat Data Input X-Grid 
A New Transformed Output Y*Grid 


FIGURE 47. GEOMETRIC RELATIONSHIP OP ORIGINAL AND NEAREST-NEIGHBOR 
TRANSFORMED LANDSAT IMAGE CELLS. The new or output grid represents a counter 
clockwise rotation and rescaling of die original input grid, ey it the total Euclideart error 
distance introduced by the nearest-neighbor resampling technique (from Reference 34). 


• Nonlinear line-scanner sweep 

• Line-scanner viewing geometry 

• Non-north-oriented image. 

Resampling techniques for geometric image transformation include the bilinear and cubic convolution, nearest 
neighbor (NN), and various truncated versions of sine x/x. It has been shown (Reference 32) that the cubic 
convolution function gave substantially higher-quality images for typically encountered scenes. Nonetheless, 
for economy in the use of computer resources, it was decided to proceed with an NN resampling of the un- 
rectified digital scene. Little geometric error is introduced by using this process when the scene content 
changes slowly or when the interpolated value lies close to an initial X-grid point (figure 47). Further, errors 
induced by NN resampling should be regarded as residual geometric errors rather than as intensity errors. 
Lastly, initial efforts at developing an NN resampling computer program had already produced a rudimentary 
but workable and economical piece of software. 

Tlie NN resampling program was improved to accommodate the full four-channel, 600- by 600-element image 
needed to cover the Greater Denver Metropolitan Area. These alterations also provided for square resampling 
units of any specified acreage or rectangular sampling units representing the correct geometry for both six 










or eight vertical lines per inch displays at any given scale (figure 48). The improved software (ROTATE21 
operated in an open-loop” geometric correction mode without feedback from any ground control points. 
However. tMn initial rectification (figure 49) of three corrections (scale adjustment! Lrth rotation skew and 
image rotation) was Airther improved with the addition of the nonlinear line-scann;r swTerco3oa 

Landsat image height and ^dth dimensions we jdjusted to match pictorial detail on 1 :24 000-scale rectified 
. J® !l!® <^®"’e8Ponding control mosaic of four by four 7.5-minute USGS topographic map sheets 
(figure 5). TTie rectilinear road network appeared dark on MSS-7 solar infrared gr^yLps, and was the chief 

and^lriT^ width parameters. Spot checks of the geometric fit on the extensive network of lakes reservoirs 
a^ITth distinctive features. A spatial registration between the rotated MSS-7 ’ 

ffaymaps and the base-map mosaic was finaUy achieved, matching the desired number of picture elements 
to within one cell over the 39-km (24-mi) image height and width* 

^ch of the four basis MSS bands was rectified for square 0.4-ha (1. 1 1 1-acre) picture elements yielding nine 

d^fn “I ^ equal to the 4-ha (lO^acre) landscape data cell.^This was 

done one band at a time, because a somewhat larger scene was transformed to ensure that the desired study 
area was enclosed, and the entire core capacity of the computer was occupied in the 6^^600^^^ 

rect.ta,.o|. Hence f<H^ MSS Sta8l«.h«mdffle,,615lm«b, 654 0 ^^^^ 

r T of <l>e founband ffle. and then trimmed to yield the exit stady 

area of 576 lines by 576 columns (figures 50 and 5 1 ). ^ 



FMMATfnta efficiencies OF THE IMAGE GEOMETRIC TRANS- 

FORMATION. The application of the nearest-neighbor approach in resampling to a 
square grid transfers percentages of the samples shown from the input grid (X-grid) 
to the output grid (Y-grid) as shown in figure 47. 
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Ditpidy teals 1:173,600 


(e) Reetified rad image 
(MSS band 5 • 0.6 to 0.7jum). 


Display teals 1 : 173,000 


(d) Rsctifisd tolar IR image 
(MSS band 7 • 0.8 to 1.1/um). 


AREA. The raw or unrectified Landtat picture elemente, (a) and (b), are 192- by 259-foot inclined rerten i 
acres, or approximately 1.1 acres per picture element. The display scale of (c) and (d) is nominally 1 :173.600. 
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Diiplay tC9le 1:250,000 

FIGURE 51. LANDSAT-1 MSS BAND 7 SOLAR INFRARED IMAGE OF THE DENVER METROPOLITAN 

iTS ^ ^ three by.three array 

of these image cells of 10 acres exactly overlaid the square lO-acre cells of the landscape data planes. ^ 
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Praprocevlno to Form MSS-Band Ratios 


Ratioing has been advocated as a means of effectively reducing random fluctuations of MSS rellcctancc 
values caused by source variations and changing atmospheric conditions (References 34 and 35). Ratioing 
is simply the division of the digital radiance value of one MSS band by that of another on a cell-tt»-cell basis. 

A ratio of the near-infrared and chlorophyll absorption bands proved to be well correlated with the functioning 
green biomass in grasslands ( Reference 3u). Tlic ratio of MSS hand 7 to band 5 enlianccd the effect of bit»- 
mass changes (Reference 37). The main advantage of band ratios In vegetation classiUcation is an improved 
signal-to-noise ratio (Reference 37). 


Twelve possible ratios can be computed for the four primary MSS bands taken two at a time; liowcvcr. six 
of these are merely the Inverse of the other six. Because the spatial variation in tlie ratio of any two spectral 
bands is the same as the ratio of the inverse of the two bands, except In a:’ Inverse sense, tlic inverse ratios 
provided no unique differences and were therefore omitted. Six ratios between the four original MSS bands 
were thus computed and interspliced back Into the multichannel image/landscape variable file using the LMS 
programs. Each picture element in this file was represented by ten image values, one each for MSS bands 
4, 5, 6, and 7 and for ratios 5/4, 6/4, 7/4, 5/6, 7/5, and 7/6. 


Merging of Landsat and Landscape Data Planes 

Digital Landsat imagery was geometrically eormeted and resampled to overlay the landscape model with a 
square cell resolution of 0.4-ha ( 1. 1 1 1 acre) that nested a threc-by-three array of Landsat cells exactly into 
4-ha (10 acres). It was next necessary to transfonn each of the 3(>,8(>4 data cells In each of the 34 landscape 
planes . the assembly of which was described in Cliapter 2, Into a three-by-three array of 0.4-ha (1.11 1-acrc) 
cells with the original 4-ha (10-acre) value duplicated in all nine cells. Tliis zoom-like transfomiation pre- 
served the spatial reff /!’ ‘ation because the Landsat image data planes were three times larger in height/width 
elements than the landscape data planes-576 picture elements to 192 landscape cells. This expansion 
function was perfonned on the 34 landscape variables with an auxiliary program (ANCILLARY). Tlie 
resultant expanded or zoomed landscape planes appeared to be somewhat blocky in comparison to their 
original form (figure 52), but they were now in perfect one-to-one registration with the Landsat imagery 
at the 0.4-ha (1. 1 1 1-acre) resolution representing 33 1.776 cells. Subsequently, the 10-variable Landsat 
imagery and the zoomed or expanded 34-variable landscape data were merged into a composite 44-variable 
data file (figure 53). These 44 variables were embedded in a standardized packed binary format of the 
landscape-modeling/image-processing system for analysis. 


Preprocessing to Remove Terrain Effects 


Normalizing Is proposed as a preprocessing technique for specifically amoving terrain effects. Normalizing 
is simply the division of the digital radiance of an MSS band by the computed Landsat overnight insolation 
on a cell-to-cell basis. Although normalizing is mathematically similar to ratioing. the denominator is an 
ancillary variable, and the resultant quotient is also „n ancillary landscape variable. 


Identical surface-cover materials may have been imaged at diff t-ent radiance values in a given MSS band 
because they occurred on varying terrain (i.e., differing slope, aspect, or topographically shaded areas). 


1 1 1 


ri 



DUplay scale 1:173^600 


(e) Three-by-three expanded slope plane 
(each cell '' 1.111 acre) 


Display scale 1:173,600 


(d) Three-by-three expanded insolation plane 
(each cell = 1.111 acre) 


FIGURE 52. DISPLAY OF THE EFFECT OF THE EXPANSION OF THE ANCILLARY MAP DATA PLANES 
IN THE LANDSCAPE MODEL FOR OVERLAY ON LANOSAT IMAGE DATA PLANES. Nonimage map data 
cells representing 10 aaes each were duplicated in a three-by-three picture-element array to exactly spatially over- 
lay the higher-resolution (1.111-acre) rectified/resampled Landsat picture elements (figure 60). Unexpanded an- 
cillary data disp'*'V$, (a) and (b), scale 1:500,-000. Expanded ancillary data displays, (c) and (d), show the ex- 
panded center one-ninth of (a) and (b) at a nominal scale of 1 : 1 73,600. 
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(e) ThTM^thrM ex|Mnd«d ilo|M (d) Thre*-by-thrM fxptnded insolation 

andllary data plana ancillary ^la plana 


FiGUftE B3. DISPLAY ILLUSTRATING THE OVERLAY OF SELECTED PHYSIOGRAPHIC MAP DATA ON 
THE LANDSAT MULTISPECTRAL IMAGERY OF THE DENVER METROPOLITAN AREA. Thlrty-aloht 
ancillary land-usa. phytiographie, sodoaDonomle. transportation, and four of MSS/lnsolat^on ratio data planas 
ware thus eomblnad with tna basic four Landut multispectral chaitnals ana sia imaga transformations (l.a., 
channal ratios) Into a oomposita 48<hannal Landsat imaga/ancillary data tiia tor automated imaga processing. 


)l^ 






ir those illumination clYects are proportionally constant for both the MSS band and insolation value, the 
ratio ol the two parameters will negate the terrain effect because it appears in both the numerator and 
denominator. 


The LMS programs (Appendix B) were used to create (TRANSF2) four MSS/insoIation normalized bands 
and to intersplice (('OMBINH) them back into the composite image/landscape data file as the last four an- 
cillary landscape variables. Therefore, the complete 48-variable data file was cortiposed of the four original 
MSS bands, six MSS band ratios, and 38 land-use, physiographic, socioeconomic, transportation access, and 
MSS insolation normalized ancillary landscape variables (figure 3). 


MULT-IVARIATE CLASSIFICATION WITH RECTANGULAR TRAINING SETS 

Feature selection with multispectral scanner data commonly uses training sets. A training set is a subsampte 
whose identitication Is known to an acceptable level ol accuracy. The training set is used to generate subpopu- 
lation statistics for efficiently implementing decision rules in the following classification phase. Overall utility 
of the feature extraction process is tntingent on the quality of samples selected to serve as a training set (i.e.. 
sample size, spau d distribution, and efficiency). Tlie basic problem is the identification and selection of 
samples with the correct probability distributions in n-dimensional space, where n is the number of variables 
ol the multispectral imagery. Tliese probability distributions are then used to construct decision rules that 
can be used to represent the prediction sample set. 

A nutiioer ol data displays are used with MSS data to lacilitate feature selection, such as histograms for each 
class and data plane or spectral overlay, as well as correlation and covariance matrices between classes. Para- 
metric discriminants, such as Bayesian probabilities or maximum likelihood functions, are usually used with 
training-set data to develop the classification algorithms. Here, linear-discriminant analysis was used as in 
Chapter 3 to reduce a multivariate problem to a univariate situation. 

Others have tested many alternative procedures because of the great amount of computational time required 
for the maximum-likelihood ratio. For example, it is possible to reduce the number of data through sampling 
and to save time with the decision rule itself (Reference 38). The decision rule commonly used is founded 
on two assumptions: ( 1 ) that the data are (Jaussian or normally distributed: and (2) that the training data for 
each class adequately represents the entire class. Under these assumptions, the maximum-likelihood decision 
rule becomes a quadratic rule. Tliis rule requires many multiplications for each decision, especially when 
many channels of data and many desired identification classes are used. The linear-decision rule also compared 
lavorably in accuracy and was 50 times faster than the maximum-likelihood function (Reference 38). 

Ver>- fast nonparametric procedures have been developed (Reference 39). Tliese linear-discriminant functions 
achieved satisfactory results with multiclass/multispectral data as long as the order of class separability was 
obser\ed during the classification process. Their algorithm took 5 minutes for one area, as compared to 70 
minutes for six channels and 5 hours for 1 2 channels with the maximum-likelihood algorithm, l-conomies 
like these will be important in making the computer more available (and acceptable) for future machine 
processing of multispectral imagery. 

Other nonparametric classification studies that either cited increased speed or satisfactory predictions used 
a composite sequential clustering algorithm ( Reference 40). an elliptical boundary condition model ( Reference 
41 ). and a lookup table procedure that was 30 times faster than the maximum-likelihood .atio and generally 
as accurate ( Reference 42 ). 


" ‘‘^at«r«>extraction activities In this study Included careful selection of 72 rectangular training sets 
totaling . 413 picture elements, or three rectangles each to represent the 24 second- and third-ordc^r USGS 
and-usc dmes to be mapped. Geometrically rectified and resampled MSS-5 and -7 graymaps overlaid on 
ri e 1 :.4.000-sc^le topographic/land-use maps facilitated the entire process of selection Ld Identification 

le clte^l^^^ recognition and mapping of the 24 land- 

The initial or baseline rectangular tralnlng^sct classification used only the original four MSS channels and 
no other mformation to correctly reclassify 65.2 percent of the t. Jning sample back InT thel cofec^^^^^^ 
me (table 13). This percentage (FOM) will be referred to hereafter as training-set accuracy. An overall FOM 

fl^r h a ^ percentage (Reference 34). A second baseline classification used the original 

of tn>mfo™ed imag. c-hanneb consisting of the ratios of the basic four bands to 
^eld an improvement of 2.1 percentage points to an overall average training-set accuracy of 67 3 percent 

Sc2n 14) """ 

Nex t, the 31 landscape data planes, exclusive of any categorical-type land-use data planes, were included with 

accuracy of the correct training-set identification to 99.7 percent for the ^>4 
classes table 15). Therefore. 99.7 percent of the 2.413 test image cells could be correctly assiXd to thei^^ 
known land-u^ category. The random assignment of these cells to the 24 land-use categories would have 
yrelded I n = 24 classes times 100 percem - 4.1-pereent accuracy, Tims, this repreSrmTa "^11^ 
^a» m Uie accuracy of automatically interpreted Undsat imagery with the symbiotic Inclusfon of spatially 

roun?ho!r‘”“'^^ Tf ®"*y variables were included In the final model because total year- 

round housing units tailed the stepwise tolerance test for inclusion in the function at step 23. 

TABLE 13 

arwlvsi,. ^ “"I were d«.lll«l usin, linearmiscrlmlnam 


Landsat 

Number Variable l-ntered (free) 

1 MSS-7 (solar infrared) 

2 MSS-5 (visible a'd) 

MSS^4 (visible ^reen) 

4 MSS-(> (solar IK) 

*C’-P = central priKcssore 


rraining-Set 

OassiUcation 

(-P* 

Correct Points/C-P 
Second lix pended 

Total 

Points 

C'orreet 

C'orrect 

Points 

(^') 

Time 

Fxpended 

(seconds) 

Step 

Average 

1.208 

50.06 

Kg 74 

1.18.22 

1.^8. 22 

I..SI8 

62. VI 

‘J.bK 

1.56.82 

147.‘)‘) 

1.585 

65.6V 

10.48 

151.24 

14 V. 17 

1.5 7.1 

6.5.1V 

:4.78 

6.^.48 

10V.61 


^'•Value 
to Filter 

Lb40.l3 

5bl87 

IL5.% 

IO4O7 
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table 14 

TRAINING-SET ACCURACY OF AUTOMATED INTERPRETATION OF THE FOUR ORIGINAL LANDSATMSS BANDS 
f ^ SINOLE.DATE LANDSAT IMAGE (A«u„ ,B, 19731 nHE DENi^BME^EOTOL^^^^^^ 

ifclassified using Imear^discrlminant analysis. «»rnuM onu wure 


Step 

Number 

Lundsut 

Variable lintered (t>ee) 

Training-Set 

Classification 

C'-P^ 

Time 

Expended 

(seconds) 

Correct Poinls/C-P 
Second Expended 

I--Value 
to Knter 

Total 

Points 

C'ora'ct 

C'orrect 

Points 

i'i) 

Step 

Average 

! 

MSS- 7 (solar inlVaa'd) 

t.208 

i0.0b 

11.49 

10.5.13 

105.13 

1.640.13 


MSS-5/MSS-4 ratio 

1.479 

61.29 

12.12 

121.93 

113.76 

573.23 


MSS-7,/MSS-(> ratio 

1.522 

6.1.08 

12.83 

1 18.63 

11.5.47 

227.78 

4 

MSS-5 (visible red) 

1.552 

(^4.32 

13.63 

113.87 

115.04 

93.89 

5 

MSS-4 (visible gacn) 

1.007 

66.60 

14.42 

1 1 1.44 

114.23 

192.63 

6 

MSS-6/MSS-4 ratio 

1.621 

<>7.18 ■ 

15.30 

10.5.95 

112.64 

108.88 

7 

MSS-7/MSS-4 ratio 

1.623 

67.26 

16.22 

100.0<> 

110.19 

54.06 

8 

MSS-5/MSS-6 ratio 

1.615 

66.93 

16.93 

95.39 

108.25 

28.90 

<) 

MSS-6 (solar intraad) 

1.624 

67.30 

17.45 

93.07 

106.22 

10.15 

10 

MSS-7/MSS-5 ratio 

1.624 

' 67.30 

34.30 

47.35 

93.96 

6.05 

-r = central processor. 





he stepwise linear-diseriminant analysis algorithm automatically added each landscape variable in the order 
ri which It added the most to the land>use classification accuracy achieved at that step. Clearly, the addition 
*t niany ot the less sensitive landscape variables did not measurably increase the final accuracy achieved, but 
agnificantly increased the total cost and decreased the cost-efficiency of the test classification (figure 54). 
hese accuracy and cost-performance values were used as the basis for selecting three optimal MSS spectral 
aiids and four landscape variables from the 41 initial variables fora final test of a more efficient and economic 
rocedure. Tliese seven optimal variables served to correctly assign all 24 land uses with an average training- 
_'t accuracy of 96.6 percent (tabic 1 6), as compared with the 99.7-perccnt accuracy achieved with 40 variables 
•t almost two times the cost in central-processor time. 


he utility of the landscape data planes overlaid on the tandsat spectral data base was evident in all classifi- 
rition tests. However, a potential time bias problem was inherent in most of the ancillary data planes Spe- 
-I ically, it was felt that the time-related cultural data (i.e., 1970 Census data) might unduly innueiicc the 
astrimmant lunctions toward the historical date represented by these landscape variables, particularly in 
cw ol their high l -values. Tlierefore. it was considered appropriate to statistically force in the current (1973) 
aiulsat imagery variables in their free order before the totally free stepwise entry of ancillary variables to 
^ducc llie ancillary data time bias and to generate a more representative land-use map. 


TABLE 15 

IMPROVEMENT IN THE TRAININQ-StT ACCURACY OF AUTOMATED INTERPRETATION OP A csiwri c hatc 

lanwat;i image of the Denver metropolitan area by adding 

virjabl«S2i ''"'«•>!« weri* forced in the predetermined order (table 14) and the landscape 

enables were added In a free stepwise fashion add were classihed using linear-discriminant analysis. 


Step 

Number 


3 

4 

5 

6 

7 

8 

10 


11 

12 

13 

14 

15 
1 (> 

17 

18 

19 

20 
21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 


Variable rntcred 


tamisat (foamed) 

MSS-7 (solar mfraa*d) 

MSS-7/MSS*4 ratio 
MSS-7/MSS^ mtto 
MSS>5 (visible red) 

MSS*4 (visible green) 

MSS^/MSS-4 ratio 
MSS-7/MSS*4 ratio 
MSS>5/MSS.6 ratio 
MSS^ (solar infrared) 

MSS»7/MSS>5 ratio - 

Landscape (free) 

Total census>tract acreage 
Built-up urban area MD*** 

Topographic elevation 
Freewa> interchange MD 
Freeway MD 
Median housing-unit rent 
Composite major-toad MD 
Median housing-unit value 
Total vacant housing units 
Averafo,* number of cars per family 

Average number of vacant housing 
units per acre 

Fopulatton density per acre 
Total population 
1969 mean family Income 
Total two-car families 
Total three<ar families 
Total year-round housing units 

Average number of year-round housing 
units per acre - - 

Total families 

Total one-car families 

Average number of families per acre 

Landsat-I image Insolation 

Topographic slope 

Surficial geulogy 

Composite minor-road Ml) 

Landsat-1 MSS-5/insolation ratio 
Landsat*l MS5^4 msiilatiun ratio 
Lamlsat*l MSH- 7 Insolation ratio 
Topographic aspect 
Landsat-l MSS-6, insolation ratio 


Trai 

(loss 

ning-Set 

iflcation 

. C-P^ 

C’orrect PPints/C-P 
Second Fxpended 


Total 
. Ppintt 
('orrect 

Correct 

Points 

('U 

Time 

Fxpended 

(seconds) 

Step 

Average 

F-Value 
to Fnter 

L208 

.50.06 

24.22 

49.88 

49.88 

1.640,13 

1,479 

61.29 

24.98 

59,21 

.54.61 

573.23 

1,522 

63.08 

2.5.76 

59.08 

56.15 

227.78 

1.552 

64.32 

26.51 

58.54 

.56.78 

93.89 

1.607 

66.60 

27.27 

58.93 

57.23 

192.63 

1.621 

67.18 

28.01 

57.87 

57.35 

108.88 

1.623 

67.26 

29.00 

55.97 

57.13 

54.06 

1.615 

66.93 

29.82 

54.16 

56.72 

28.90 

1.624 

67.30 

30..36 

53.49 

56,32 

iai5 

1.624 

67.30 

.M.I4 

.52.15 

55.85 

6.05 

2.036 

84.38 

31.72 

64.19 

.56.71 

133,372.71 

2,162 

89.60 

32.49 

66..54 

57.64 

1.294.12 

2.278 

94.41 

3.L17 

68.68 

58.62 

1.060.76 

2.343 

97.10 

34.00 

68.91 

59.48 

349.27 

2.353 

97.51 

.U.73 

67.75 

60.13 

626.06 

2.349 

9735 

35.51 

66.15 

60.57 

.349.55 

2.353 

97... 1 

.^6.2*» 

64.87 

60.88 

221.08 

2.371 

98.26 

36.98 

64,12 

61.09 

224.90 

2.380 

98.63 

37.75 

63.05 

61.22 

182.63 

2.376 

98.47 

38.56 

61.62 

61.24 

119.81 

2.387 

98.9> 

.^9.33 

60.69 

61.21 

1 10..19 

2.382 

98.72 

39.86 

59.76 

61.13 

97.27 

2,383 

98.76 

40.61 

58.68 

61.00 

96.18 

2.370 

98.59 

4U4 

.57.55 

60.82 

16.3.80 

2.384 

98.80 

42.08 

56.65 

60.60 

135.20 

2.390 

99 05 

42.77 

5.5.88 

60.37 

102.20 

2.396 

99 30 

4.5.73 

52.39 

59.98 

105.41 

2.403 

9Q.59 

4.5.17 

.53.20 

59.66 

97.37 

2.398 

99..(8 

45.10 

.5.1.1? 

.59.37 

102.18 

2.395 

99.25 

4.5.77 

52..13 

.59,06 

142.24 

2..W5 

99.25 

46.84 

51.13 

58 7.1 

91.83 

2.399 

99.42 

47.55 

50.45 

58.39 

93.71 

2.405 

9<).67 

48.10 

50.00 

58.05 

100.84 

2.405 

99.67 

48,69 

49. .19 

.57.71 

38.35 

2.405 

99.67 

49.60 

48.49 

57 36 

35.28 

2,405 

99.67 

50.31 

47.80 

57.00 

9.45 

2,405 

99 67 

51.01 

47.15 

56.64 

24.72 

2,405 

*W.67 

51.77 

46.46 

56.28 

8.36 

2,405 

99.67 

52.54 

4,5.77 

55 91 

7.43 

2.405 

99.67 

5.1.35 

45.08 

55 .54 

3.96 
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P « central pnvessor. 
tMD *= minimum distance. 
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TABLE 16 

HIGH TRAINING-SET ACCURACIES AND ECONOMY ACHIEVED BY AUTOMATED INTERPRETATION OF A SELEC- 
TION OF THREE MSS BANDS OF A SINGLE-DATE LANDSAT IMAGE (August 15, 1973) OF THE DENVER METRO- 
POLITAN AREA WITH OVERLAYS OF FOUR LANDSCAPE VARIABLES. This table Is based on Identifying 24 land uses 
(table 1) with three rectangular training sett per land use that also provided the test sample of 2,413 picture elements. The 
Landsat variables were forced In the predetermined order (table 13), and the ancillary landscape variables were added in a free 
stepwise fashion and were classified using linear-discriminant analysis. 




Training-Set 

Classification 

C-P* 

C'orrect Points/C’-P 
Second Expended 


Step 

Number 

Variable Entered 

Total 

Points 

Correct 

C'orrect 

Points 

(^0 

Time 

Expended 

(seconds) 

Step 

Average 

F-Value 
to Enter 

1 

Landsat (foieed) 

MSS-7 (solar infrared) 

K208 

50.06 

10.17 

118.78 

118.78 

1,640.13 

2 

MSS-5 (visible red) 

1,518 

62,91 

11.04 

157.50 

128.52 

562.87 

3 

MSS-4 (visible green) 

K585 

65.69 

1 U70 . 

135.47 

130.99 

115.96 

4 

Landscape (free) 

Total census- tract acreage 

1,992 

8155 

1132 

161.69 

132.35 

113,029.23 

5 

Built-up urban area MD^ 

2,143 

88.81 

13.19 

162.47 

146.96 

1.33126 

6 

Topographic elevation 

2,242 

9191 

13.77 

16182 

149.98 

1007,72 

7 

Freeway interchange MD 

2,332 

96.64 

27.57 

84.58 

131.91 

356.92 


♦C*P * central processor* 

« minimum distance* 


It has been carefully observed that, by the use of the term “training-set accuracy,” these classification tests 
indicated only the rectangular training-set accuracy for the use of Landsat and landscape or ancillary data 
variables for mapping land use. Further testing was done to determine how accurately these feature extraction 
procedures extended to the mapping of the entire Denver study site-the “verification” accuracy. The verifi- 
cation test was performed by classifying the 36,864 cells of a larger one-ninth sample image with the discrim- 
inant function developed from the 72 rectangular training sets for each variable added in the stepwise fashion 
previously determined (table 1 5). This one-ninth training set was generated by rectilinearly sampling every 
third row and third column for a total sample of 192 rows and 192 columns from the fttll 576- by 576-picture 
element composite image. The accuracy of each successive map produced by this function in the step-by-step 
fashion was then verified on a cell-by-cell basis by comparison with the known 1972-1973 USGS land-use 
map stored in the landscape model (figure 55). The cost of computing and comparing a total classification 
map at each step prevented the continuation of this process for all 40 steps. The average image-verification 
accuracy was a surprisingly low 24. 1 percent at the addition of the 28"' step (figure 56 and table 17), as 
compared to the predicted training-set accuracy of 99.6 percent at the same step (table 15). The rectangular 
training sets, carefully selected to statistically represent the 24 land-use classes by 2,413 points, were clearly 
inadequate samples to represent the total sampled image of 36,864 points. Apparently, a fortuitous choice 
of training-set samples gave extremely high training-set accuracies in this restricted pattern space. However, 
the training sets were statistically insufficient descriptors for classifying the larger image sample and. for that 
matter, the full image because the one-ninth sample image statistically represented the full image. 
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FIGURE 56. VERIFICATION OF THE MAP AC- 
CURACY FOR THE DENVER METROPOLITAN 
AREA FOR ALL 1972-1973 USGS LAND-USE 
CLASSES (August 16, 1973 Landsat-1 image). A 
point was piotted in biack if the individual cell 
was correctly classified when checked against the 
corresponding cell on the digital 1972-1973 USGS 
land-use reference map. The linear-discriminant al- 
gorithm developed from the rectangular training 
sete was used to classify this orte-ninth rectilinearly 
sampled image. It used the first 20 variables and cor- 
rectly classified 9,740 points into 24 second- and 
third-ordw classes for a 26.4-percent accuracy. 


step order. A total of 1,554 points were correctly classified after the four steps, yielding an FOM,* or 
overall accuracy of 37.9 percent (figure 57 and table 18). Since the one-ninth by one*ninth rectilinear*point 
training-set sample of 4,100 points constituted a statistical sample of the points in the final map, this accuracy 
closely represented the final map accuracy tliat might be expected. It therefore directly represented map or 
verification accuracy in lieu of the training-set accuracy usually achieved by this type of activity. The repre- 
sentative nature of the verification accuracy achieved here with the 1 /8 1 -sample relative to the final, total 
accuracy achieved by using the same discriminant function will subsequently be clearly established. 

The second test made use of the ten-band combination of four original MSS bands and six transformation 
ratios and achieved a slightly improved 38.4-percent verification accuracy. Again, the six MSS ratios contrib- 
uted little to the improvMnerit of the classification (figure 58 and table 19). 

The full array of landscape variables, less the land-use data planes, were next included with the ten image 
bands. Verification accuracy was raised 16 percentage points to 53.9 percent by using these 31 nonland-use 


♦The FOM- the average cki.sslfication accuracy- is defined as the total number of correctly classified cells, divided by the total number of cells 
times 100 to obtain a percentage value. 
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^ VERIFIED LAND-USE ACCURACY FOR AUTOMATED INTERPRETATION USING REC- 

Zn. «“ from 5» ?h J 

tralmno sett of eadi land use representing a 2,413-point sample. It was applied in the same stepwise fashion as pre- 
viously de^mined (table 15) to classify a rectilinearly sampled one-ninth image of 36,864 points. This sample imaoe- 
clmification accuracy was only 24.1-percent correct, based on a point-to-point comparison to th^ i972-1973 USGS 

coinei/tA i*k ’^**"^'* ‘”f* "'anual training-set selection was clearly evident in this case. The variable numbers 
Sf^lT «'-'«“tion, but theracc" 
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bndscape variables to classify the 24 land uses sought. Although not as spectacular as the Increase in trainlng- 
set act^cy achieved when using the rectangular training sets, the 53.9-pereent verification accuracy was 
realistically representative of Iarger*scale iniage*classincation results (figure 59 and table 20). 

Verification 


Step 

Variable Entered 

Training-Set 

Classification 

C-P* 

Tima 

Correct 

Second 

Pomts/C-P 

Expended 


Number 

Total 

Points 

Correct 

Correct 

Points 

(%) 

1 ime 

Expended 

(Seconds) 

Step 

Average 

F-Value 
to Enter 

2 

Landsat (forced) 
MSS-5/MSS4 ratio 

11.18S 

30.34 

378.81 

29.53 

29.53 

573.23 

4 

MSS*5 (visible red) 

10,708 

29.05 

403.74 

26.52 

27.98 

93.89 

6 

MSS<>/MSS-4 ratio 

10.598 

28.75 

426.92 

24.82- 

26.86 

108.88 

8 

MSS»5/MSS-6 ratio 

10,365 

28.12 

461.85 

22.44 

25.64 

28.90 

10 

MSS-7/MSS-5 ratio 

10369 

28.13 

478.69 

21.66 

24.76 

6.05 

12 

Landscape (free) 

Built-up urban area MDt 

12,075 

32.76 

499.91 

24.15 

24.64 

1,294.12 

14 

Freeway interchange MD 

11.425 

30.99 

527.16 

21.67 

24.15 

349,27 

16 

Median housing-unit rent 

11.445 

31.05 

554.67 

20.63 

23.63 

349.55 

18 

Median housing-unit value 

10.417 

28.26 

580.60 

17.94 

22.86 

224.90 

20 

Average number of cars per family 

9.740 

26.42 

599.09 

16.26 

22.06 

119.81 

24 

1969 mean family income 

9.162 

24.85 

638.01 

14.36 

21.17 

163.80 

28 

Average number of year-round 

8.870 

24.06 

682.75 

12.99 

20.28 

97.37 


housing units per acre 





•C-P » central processor* 
tMD ® mininutm distance* 


The discriminant function for each step was saved from the 41-variabIe disc'riminant analysis of the 4,100 
gnd-sampled point-training set for verification testing on the one-ninth sampled image. The classification 
results and resultant step-by-step display maps confirmed the utility of the 1/81 -grid-sampled training set 
as a valid and representative sampling of the larger image and illustrated a sucojssful display product of the 
new systematic, point-feature extraction process. The 52.9-perc'ent verification accuracy for this one-ninth 
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CLASSIFICATION VARIABLES 

' 6 ; ' 8^1 

numbers in table 18. stepwise fashioa The variable numbers coincide with the step 




S';:™? ™UR OR,a,N.. MSS RANOSOP AN 

(table I) with a test sample of 4,100 picture elements These oinmpnt * ° identifying 24 land uses 

nin,.bv„,™ of Picture elomentt e^l J„,Z L «» “"•« «> ««l- 

land-uso data plane. The UndsaM image variables we!e ‘he land-uso codes specified by the 1972-1973 USGS 

discriminant analysis, ^ stepwise fashion and classified using linear* 


Step 

Number 

Lundsut 

Variable l-nlercd (free) 

Training- Set 
Classitlcntion 

c-p* 

Time 

Hxpended 

(seconds) 

Cora'ct Points/C’-p 
Second 1 xpended 


Total 

Points 

Correct 

Correct 

Points 

(VI) 

Step 

Avenige 

I'-Viilue 
to Pnter 

1 

3 

4 

MSS-7 (solar infrared) 
MSS*4 (visible green) 
MSS-5 (visible red) 
MSS-(> (solar infrared) 

I.2TO 

I..156 

1.554 

1.554 

.M.4b 

5.1.07 

.17.90 

.17.90 

14.79 

I6..12 

18.54 

19.95 

87.22 

8.1.09 

8.1.82 

77.89 

87.22 

85,05 

84.59 

82.67 

68,50 

33,21 

35,41 

1,23 

♦C 4* « central processor. 





each successive map produced in this stepwise fashion was verified on a point-to-point basis (36 864 ^ 

^"ri^sTr^ZuTb’';' xr*" •“ '"w- 

apHc.,.„ru,,„rZrur,hLi:f:;::zrrs^^ 

.ribuM by ,bc Ccpwbc-uddcd variable. , .able J 1 ,, ,„,c.her with l',s spaL r.llrZ 3,7' 
assessed .n this lashion relative to the 1972-1973 USCS reference theme maps tflgures 61 and 62). 

Finally, two additional verified classification maps were displayed for the last or 4 1 « sten «.nhI,« ti 
nint s^iple image. The average composite second- and 

the 19.482 correctly classified points ( figure 63a). The average first order* verified Luraev^^^^ 
percent tor the .8.87 1 correctly classified points ( figure 63b) out of a population of 36.864 pictim. elements. 

maximum-likelihood classification with grid-sampled training sets 

lincar-dlscriminan. funclo^ r«p“lv”y ratio and the 
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FIGURE B8. VERIFICATION OF ACCURACIES AND COSTS OF PROCESSING SINGLE DATE (August 16, 
1973) LANDSAT IMAGERY OF THE DENVER METROPOLITAN AREA WITH THE FOUR PRIMARY AND SIX 
MSS-BAND RATIOS. This figure is based on identifying the 24 land uses (table 1) with 4,100 grid-sampled picture 
elements and the 1972-1973 USGS land-use data plane. The Landsat variables were added in a free stepwise fashion. 
The variable numbers coincide with the step numbers and MSS-band/ratio Identities in table 19. 
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TABLE 19 

TOEIR SIxTATinc^P^^^ interpretation OF THE FOUR ORIGINAL MSS BANDS AND 

ADCiT ^ SINGLE-DATE LANDSAT IMAGE (Augu*t 15, 19731 OF THE DENVER METROPOLITAN 

AREA. Thi$ table is based on identifying the 24 land uses (table 1) with a test sample 4,100 picture elerhents. These ole- 
ments we« systematically wmpled from the center of each nlne-by-nlne array of picture elements and identified and grouped 
by the land-use wdes stifled by the 1972-1973 USGS land-use data plane. The Undsat-1 image variables were added in a 
free stepwise fashion and were classified using linear-disairninam analysis. 


Step 

Number 

Landsat 

Variable b'ntered (free) 

- — 1 

Training-Set 

C-lassitlcation 

(vp* 

lime 

l\pended 

(seconds) 

C'orrect Points/C'-P 
Second l^xpended 

P’-V^alue 

to 

Total 

Points 

C'oravt 

C'orrect 

Points 

(‘T) 

Step 

Average 

1 

MSS-7 (solar infrared) 

uoo 

51.40 

\^.\4 

07.40 

07.40 

08.50 


MSS-5/MS.S-4 ratio 

1.417 


20.50 

(W.I2 

08.20 

57.02 


MSS-7/MSS-.S ratio 

U4^5 

5(>.4(> 

21.70 

08.70 

08.44 

58.01 

4 

MSS-4 (visible green) 

K55I 

57.85 

2.1.15 

07.00 

08.04 

25.40 

5 

MSS-5 (visible a-U) 

U570 

58.20 

24.41 

04.52 

07.21 

52.50 

t) 

MSS-7/MSS-4 ratio 

U5<>: 

.18.10 

25.75 

oO.oo 

05.00 

10.54 

7 

MSS-.S/MSS-b ratio 


.18.10 

2o.8o 

58.15 

04.00 

O.10 

8 

MSS-7/MSS-b ratio 

1.570 

.18. 'O 

28.0‘) 

55.80 

o5..^o 

5.54 

g 

MSS-o (solar infrared) 

K575 

.18.41 

2‘)..1*> 

55.50 

02.05 

1.08 

10 

MSS-b/MSS-4 ratio 

1.575 

.18.41 

.10.77 

51.10 

00.71 

2.28 


♦(.'•P = central processor. 


ITic same 4.100 entities of the 1/81 grid-sampled point-training set were used to generate the mean and 
covariance matrices lor classifying the one-ninth sampled image by maximum-liklihood ratioing with four, 
six, and 22 variables. Tlie four-variable combination was the basic four MSS bands. The six-variable set included 
MSS-4, MSS-7. MSS-7/MSS-5 ratio. MSS-5/MSS-4 ratio, topographic elevation, and urban built-up minimum 
distances. Tlie 22-variable run contained the maximum number of the 4 1 variables tliat could be used (that 
is, those variables with nonzero variance in all classes and therefore capable of matrix inversion), lliese were 
the tour basic MSS bands, six MSS ratios, four MSS/insolation ratios, topographic elevation and aspect, in- 
solation. and five transportation minimum-distance variables. 

These three combinations were applied to the one-ninth sample image using maxinium-likeliluxHl ratioing and 
were checked tor first- and composite second-/ third-order accuracy on a point-tivpoint basis for the Jt(>,8o4 
points against the l‘)72-l')7T USCiS land-use data plane, lliese ma|vverification accuracies checked and dis- 
played to the first order were 5,T(», (>8.1, and (>1.,‘> percent, respectively, for four. six. and 22 variables 
(figua' 04). whereas the composite second- and third-order accuracies were 4..T 1 5A, and 2.4 percent, 
a'spectively (figure (>5). 


The 1/81 grid-sampled-point data set was classified by stepwise discriminant analysis for the s;une combi- 
nation of variables to provide comparative results at both the first- and composite second- and third-order 
levels for the nonparametric comparison (table 22). Although these linear-discriminant analyses were run 
on a smaller data set than that ot the maximum-likelihood ratio tests, the close correspondence of the 1 /8I 
data set to the one-ninth sample-image results has already been documented. 
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Tlic poor performance of the maximiiin«HkeUhood ratio, especially for the second* and third-order classes, 
was surprising. A detailed explanation for this occurrence is not currently possible. However, the fact that 
the first-order accuracies were eomparatively high relative to the composite second- and third-order accuracies 
may provide a clue. It is believed that the capability for specifying a priori class probabilities for discriminant 
analysis materially improved all its classifications. No such capability existed in the maximum-likelihood 
algorithm used. Hence, it could achieve comparable results to discriminant analysis for only six first-order 
elasses; however, the statistical similarity of 24 second- and third-order USGS land-use classes proved too 
much for it to handle. 

A prerequisite for the use of the maximum-likelihood ratio algorithm is that the data are multivariate nor- 
mally distributed (Appendix D). The non-Gaussian distributions of the 4,100 grid-sampled training-set points 
could have conceivably accounted for the poor performance of the maximum-likelihood ratio in classifying 
the 24 land-use classes of the one-ninth image sample. However, the central limit theorum essentially states 
that the distribution of either the sums of averages of n measurements drawn from any population tend to 
possess, approximately, a normal distribution in repeated sampling when n is large (Reference 43). The 
exact form of the distribution of the grid-sampled points is not now known, but the normal approximation 
was justifiable in the four- and six-variable combinations in which the Landsat image bands were numerically 
dominant. 

This presented the perennial problem of equal versus weiglited probabilities for predicting from a training 
set to an unknown image scene. The use of equal a priori class probabilities presumes equal likelihood of 
each land use in the larger scene, whereas weighted probabilities presume that the image analysis has “a 
priori" knowledge of the approximate amount of each land use within the classification set. Some knowl- 
edge of the amount of each land use in the scene is invariably available or could be achieved by a cursory 
inspection of Landsat imagciy in the photographic form. Tlie 1/81 point-sampled training set represented 
a probability sampling proportional to total land-use area in the larger sampled image. Tliese a priori class 
probabilities were used for both training-set and image-classification efforts. Tlic statistical benefits of 
inputing this a priori knowledge to the linear-discriminant analysis, versus the assumption of equal proba- 
bilities, was also tested and verified here for the different combinations and at both the first- and composite 
second- and third-order levels (tabic 22). 

Finally a clear economic advantage was also illustrated irrespective of classification accur.-’cic.y as the 
maximum-likelihoixl ratio algorithm took 321.2, 583.7, and 1,574.6 scc-onds, respectively, for the 
four-, six-, and 22-variable combinations. A computational time advantage emerged for the linear-discriminant 
analysis function, which took 41 7.6, 441.4, and 624.0 seconds, respectively, for five-, seven-, and 22-variablc 
combinations (table 21). Clearly, the linear-discriminant function was much less sensitive to the increasing 
number of channels. A l urthcr cost advantage emerges if cost is weighted by classificational accuracies for 
a comparative analysis. 

PRODUCTION OF LAND-USE MAPS 

After the feature-extraction step is completed, the fourth step in the pattern-recognition process is that of 
decision and classification. All cells of interest arc tested against all features selected for representation in 
the feature-extraction step. A probabilistic decision is then made to determine into which class each cell is 
placed. 
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accuracies and costs of processing single-date (Auoust IB 197^ 
DENVER METROPOLITAN AREA WITH TEN MSS-BAND/RATIC) VARIABLE 
ikh variables This figure is based on identifying the 24 land uses (table 1 
with 4.100 gnd-sampled picture elements and the 1972-1973 USGS land-use data plane. The Landsat Image variable 

Thl*« °^*hi order (table 19), and the landscape variables were added In a free stepwise fashioi 

The variable numbers coincide with the step numbers and MSSband and data-plane identities in table 20. 
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TABLE 20 

improvement in the verification accuracies of automated interpretation by the addition 

OF 31 ANCILLARY VARIABLES FOR A SINGLE>OATE (August 15, 1973) LANDSAT IMAGE OF THE DENVER 
METROPOLITAN AREA. This table is based oit identifying 24 land uses (table 1) with a test sample of 4,100 picture ele- 
ments. These elements were systematically sampled from the center of each nIne-by-nine array of picture elements and identi- 
fied and grouped by the landnise codes specified by the 1972-19T3 USGS land-use data plane. The Landsat-1 variables were 
forced m a predetermined order (table 19), and the landscape variables were added in a free stepwise fashion and were clas- 
Sifted using llnear^discriminant analysis. 




Ttamtng-Set 
i lassitkatinn 

c-p* 

Correct Poinis/C*P 
Second Expended I'M 


Step 

Number 

Variable ImtereU 

Total 
Points 
( orreci 

C'turect 

Points 

CM 

t ime 

Ixpended 
t seconds) 

Step 

Average 

P- Value 
to hntcr 

1 

I.au4sat (forced) 

MSS'? (^olar infraa’d ) 

1.2 VO 

51.4b 

42.45 

50. .5 V 

50.59 

68. S6 


MSS-5.'MSS-4 ratio 

1.417 

54.5b 

45.84 

52.52 

51.57 

57,02 


MSS-7 MSS-5 ratio 

1.4V5 

.V».4b 

42.25 

55.58 

32.69 

58.61 

4 

MSS*4 (visible grveu) 

1,55! 

57.85 

46.52 

55.54 

518(1 

25.46 

S 

MSS-5 (visible a»d) 

1.570 

.^8.2*) 

47.48 

55.07 

52.91 

5156 

u 

MSS-7-MSS.4 ratio 

1,5(0 

.58.10 

48.48 

52.22 

5178 

10.54 

7 

.^ISS.S.MSS.6 ratio 

1,5(.2 

58.10 

4V.57 

51.b4 

52.61 

9.10 

X 

MSS-7 MSS-b ratio 

1.570 

58.2V 

50.52 

51.08 

52.40 

5.44 

*) 

MSS<b (solar infraad) 

1.575 

5K.4 1 

51.25 

.^0.75 

5140 

1.08 

10 

MSS-h;MSS-4 ratio 

1.575 

5X.4I 

52.(>0 

2V.V4 

51.95 

128 

11 

Landscape (tree) 
Toptigraphic elesution 


45.71 

5.t.XX 

55.2b 

5108 

259.46 

i: 

Average iiumK-r ol cars per famil> 

l.vp 

4b.?(. 

55.24 

54.70 

52.55 

74.54 

1.^ 

Built-up url>an area MU* 

2.0 So 

50.15 

5b.52 

5b. 5X 

52.69 

65.55 

14 

Tttpugraphic slope 

2.040 

4V.'»(» 

5^.67 

55.5 

5191 

56.67 

IS 

Average number ol lamilies per aers* 

2.120 


5X.V5 

55.*8i 

.55.15 

51.59 

|{> 

IVbVmean lamil> income 

MIX 

51. Ob 

(»o.r 
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5.5. .50 
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17 

Median housing-unit value 

' 1 

SI. XX 

bl.44 

.U.b2 

55.59 

.50.21 

IX 
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2.115 

Sl.SV 

o2.*2 
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IV 
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:o 
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2.lf.b 

^ ’ X( 

xtllt 

'X 

51 (M 

4 5'l 

.<4 

Average minibct o| vacant hoiiMne 

’ r ■ 

St |0 

S5 12 

^X 

to x: 

t tx 

55 

uiuis per as re 

total vacant housing units 

2 |s> 

' t 2*» 

XI. t: 

2^ 2x 

to (.0 

t 4' 

5ii 

topogiapliH aspetl 

Mvt 

5t >I 

-S ‘ M 

Is 

to 40 

t 0.' 

5" 

total veat-ioiind littiiMiig units 

' -S!| 

' t I.X 

N ’ »ai 

♦Mil 

to SI 

1 V ‘ 

IX 

1 aiidsat 1 MSSii insolation lath* 

' Sill 

< t M 1 

SS V 1 

M .M 

tool 

2 lo 

IV 

\vvfage mimlvi ol Veat touml liousing 

* Ml 

s t Vi 

VM f. ‘ 

M tv 

'•1 M 

1 H ' 

4(J 

iimts pel acre 

Population dehsilv pel ,ku 

' Mu 

M '1- 

V * 

M V.| 

'V (.1 

1 to 


1 andsat 1 MSS- ‘ msolation t,iiio 

' SIX 

^ t Ss 

•I ' s 

‘ t xn 

"1 4 1 

0 «>(. 


•!’ ' * ontl.ll pinvCVMii 
‘MU inimimittt (iKl.iti. . 


i’Aa * 

Kk)R QUAJ.il'i 


130 




1 J 


'4 4 


< .1 
•i i 


FIGURE 60. MAP-VERIFICATION ACCURACIES AND COSTS OF PROCESSING SINGLE-DATE LANDSAT 
IMAGERY OF THE DENVER METROPOLITAN AREA WITH TEN MSS-BAND/RATIO VARIABLES AND 31 
ANCILLARY LANDSCAPE VARIABLES. This figure is based on identifying 24 land usas (table 1) for 36 864 
yidjainpled picture elements representing the 1972-1973 land-use data plane. The Landsat image variables were 
forced In a predetermined order (table 19) and the landscape variables were added in a free stepwise fashion and 
wwe classified using linear-discriminant analysis. The variable numbers coincide with the step numbers and MSS- 
band and data-plane identities in table 21 . 


13 ] 




TABLE 21 

«.too-wn, „„pi. „„ ,„ 


TABLE 21 

iAMPLING APPROACH TO ASSEMBLING TRAINING. 

table is based on Identifying the 24 land uses (table 1 witiTa sam^^lrf ^W“»ts (table 20). This 

center of each three-by-three array of picture elementt and Picture elemenu systematically selected as the 

USGS land-use data plane. The41 variables were entered in exaetiw ‘*^‘*'’’*®* <»ll-bV*cell comparison witit the 1972-1973 
training set (table 20). ^ * sarhe stepwise order as determined by the grid-sampled 


Step 

NumKer 


1 

3 

5 

7 

11 

13 

15 

17 

19 

23 

27 

31 

36 

41 


Variable Entered 

Landsat (foamed) 

MSS-7 (solar infrared) 
MSS-7/MSS-5 ratio 
MSS*5 (visible red) 

MSS-5/MSS*6 ratio 
MSS»6 (solar infrared) 

Landscape (free) 

Topt^raphic elevation 
Built-up urban area MD’** 

Average number of families per acre 
Median housing>unit value 
Composite mino^road MD. 

Median housing-unit rent 
Total one-car families 
Total census-tract acrea^ 
Topographic aspect 
Landsat- 1 MSS-7/insolation ratio 


*C-P « central processor. 
^MD = minimum distance. 


Training-Set I 

Classification 

Total 

c’orrect 

Points 

Points 

Correct 

m 

11,246 

30.31 

12,939 

35.10 

13,687 

37.13 

13,652 

37.03 

13,645 

37.01 

16,027 

43.48 

18,161 

49.26 

18,664 

50.63 

18,786 

50.96 

18,920 

51.32 

18,849 

51.13 

18,986 

51.50' 

19,210 

52.11 

19.405 

52.64 

19.482 

52.85 


C-P* 

Time 

Expended 

(seconds) 


368.55 
395.10 

417.55 
441.35 
467.69 

499.18 

514.49 

530.46 

55Z37 

576.98 

623.99 
672.05 

721.17 

772.18 
825.90 


Correct Points/C-P 
Second Expended 


Step 

Average 

F-Value 
to Enter 

30.51 

30.51 

68.56 

32.75 

31.67 

38.61 

3178 

3106 

5156 

30.93 

31.75 

9.10 

29.18 

31.18 

1.08 

32.76 

31.48 

259.46 

35.30 

31)1 

65.53 

35.18 

3156 

3139 

34.01 

3175 

30.21 

3179 

3179 

16.50 

30.21 

3146 

10.49 

28.25 

31.99 

6.45 

26.64 

31.42 

5.52 

25.13 

30.78 

3.02 

23.59 

30.07 

0.96 


Selection of Optimal Mapping Variables 

r, onK^ArSleT^r “ 

combination of the large number of variables and the sL of the Imar* H necessary by the 

ol image variables would be included in this subset so that thA T* that a number 

unduly mnuenced by ,h« uncilho- Undscap. ,„tobl« mapp^ 2 to ** 

MSS4) accounted for practically all of the accuracv in «,« ratio, MSS-7/MSS-5 ratio, and 




Variable 1 Added 

land use verified at the first order 


Variable 1 Added 

'^tural land use verified at the first order 



Variable 5 Added 


%sJ I 


Variable 9 Added 


Variable 13 Added 
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Variable 17 Added 
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Variable 23 Added 




1972-1973 USGS Und-Ute 
Reference Map 


Variable 17 Added 


Variable 23 Added 


1972-1973 USGS Und-Ute 
Reference Map 
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Variable 17 Added 


Variable 23 Added 
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1972-1973 USGS Und-Ute 
Reftrenee Map 


FIGURE 61. STEP-BY-STEP CLASSIFICATION OF THE 
LAND USE OF THE DENVER METROPOLITAN AREA 
VERIFIED AT THE FIRST ORDER (August 15, 1973, 
Landsat-1 image). 
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Variable 1 Added 


Variable 5 Added 


Variable 9 Added 


Variable 1 3 Added 


-cultural land use verified at the second and third orders 
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(a) Original 24 Second- and (b) Composite Six First-Order 

Third-Order USGS Ciasses USGS Ctasses 


FIGURE ea VERIFIED MAP ACCURACY FOR THE DENVER METROPOLITAN AREA FOR ALL USGS 

LAND-USE CLASSES AT THE 41** STEP (August 15, 1973, Landsat-1 image). The linear-discriminant classi- i 

fication tested here employed the 41-image/map variables exclusive of land use. A point was plotted In black 

If the individuai cell was correctly classified when checked against the corresponding cell on the 1972-1973 

US(3S land-use data plane. This Image (b) represents the same 24 classes but was diecked for correctness at the 

six f Irst-ordo* classes only. 

training-set test (figure 59 and table 20). Five test runs with no forced variables were made on the 1 /8 1 
training set with the four im^e bands and the six, seven, eight, niiK, and ten landscape variables, progres- \ 

sively adding another landscape variable for each successive test. The ten-channel combhiation using six ; 

landscape variables proved to be the most accurate, correctly classifying 2,09 1 points for a 5 1-percent ! 

verified accuracy (figure 66 and table 23). ; 

i 

i 

The use of the ten channels achieved 94 percent of the maximum accuraor that could have been obtained 

with all 41 variables. However, it represented an expenditure of only 34 percent of the processing time 

that would have been needed. i 

Each of the following first-order classification displays that resulted is composed of three thematic maps: 

(1) the 1972-1973 USGS land-use data plane for reference; (2) the verified disoiminant-dassified map 
showing only correctly mapped 0.4-ha (1.11 1-acre) picture dements; and (3) the actual discrindnant- 
classified map. Unfortunately, the lade of an appropriate color-display device necessitated the cumber- 
some display of each separate first-order theme map in black-and-white. Using this general approach, the ^ ! 

six aggregate first-order categories were displayed for the zoomed USGS reference data plane, the verified | 

classification map, and the machine-processed land-use map (figure 67). i 
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TABLE 22 

maximum^likelihood and linear-discriminant image-classification 

ALGORITHMS. The ac^racy of each of these two automated Image classifications was checked polnt-by-pelnt at a first and 

of al aU map variables. The maximum-likellhood classification was performed on the sampled image 

of 36,864 points, and for economy, the linear-discriminant tests were run on the 4,100 Systematically sampled points only. 



First-Oi 

rder Verification (percent) 

Second-Order Verification (percent) 

Nufnber 

Maximum- 

LineanDiscriminant Analysis 

Maximum- 

Linea^Discriminant Analysis 

of 

Likelihood 

Equal 

A Priori 

Likelihood 

Equal 

A Priori 

Variables 

Ratio 

Probabilities 

Probabilities 

Ratio 

Probabflities 

Probabilities 

Four* 

53.6 

50.9 

65.2 

4.3 

19.1 

37.9 

Sixt 

68.1 

70.6 

73.1 

15.4 • ... 

32.3 

46.3 

Twenty- 







Two^ 

61.5 

73.1 . 

75.6 

2.4 

37.2 

48.3 

♦Fourori^nal Landsat-1 MSS-bands. 




^Six channelsc MSS-4, MSS-7, MSS-7/MSS-5 ratio, MSS-5/MSS-4 
built-up urban-area minimum distance. 


ratio, topographic elevation, and 


Twenty-two channels: four original Landsat-1 MSS bands, six Landsat-1 band ratios, four Landsat-1 
MSS/msolaton ratios, Landsat-1 image insolation, topographic aspect, topographic elevation, composite 
n1mo^road MD, composite majoi-road MD, freeway MD, freeway interchange MD, and built-up urban- 
area MD (MD * minimum distance)* 


Sacond- and Third-Order Theme Maps 


The eleven second- and third-order urban land-use classes were displayed for the USGS land-use reference 

data plane, the point-verified discrirainawUlassried land-use map, and the actual discriminant- classification 
map (figure 68). 

“^e three third-order ugr/«i/fura/ land-use classes were displayed for the USGS land-use reference data plane, 

the point-verified discriminant-classified land-use map, and the actual discriminant-classification map (figure 
69)* 

The three second-order wa/cr land-use classes were displayed for the USGS land-use reference data plane, 

. he point-verified discriminant-classified land-use map, and the actual discriminant-clBssification lanchuse 
map (figure 70). 

The two second^rder range land-use classes were displayed for the USGS land-use reference data plane, the 

point-verified discriminant-classified land-use map, and the actual discriminant-classification land-use man 
(figure 71). 


143 




100 


90 h 
80 
70 
60 
50 
40 
30 
20 
10 
0 


— — ■ CORRECT POINTS (PERCENT) 

C-P TIME EXPENDED (SECONDS) 
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CLASSIFICATION VARIABLES 


11 


FIGURE 66. TEST OF FINAL TEN-CHANNEL COMBINATION OF LANDSAT AND LANDSCAPE VARIABLES 
FOR THE FULL IMAGE CLASSIFICATION. This figure is based on identifying 24 lend uses (table 1) with 4.100 
grid-sampled picture elements. This ten-variable combination was selected as an optimal subset of the 41 available 
Image/ancillary variables. It realized over 94 percent of the maximum accuracy that could have been obtained with 
all 41 variaUes, but represented an axpenditure of only 34 percent of the processing time. All variables were added In 
a free stepwise fashion and were classified rising linear-discriminant analysis. 
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'•'ABLE 23 

*"'“ >-AN0SC«.6 VARIABLES FOR FULL IMAGE 
A^FIGATION lAuswt IB, 1073 Imaoe). Thh table Is based on Idantlfylne 24 land uses (taUa 11 sulth 4 100 arid, 

7T “ P ««"1 0 ™«» full 41 w|T» nCat slTa^ 


Step 

Number 

Lundsat and Landscape 
Variables Entered 

Training-Set 

Classification 

Total 

Points 

Correct 

Correct 

Points 

m 

1 

Topt^niphic elevation 

L428 

34.83 

2 

Built>up urban-area MD^ 

IJOl 

41.49 

3 

Average number of families per acre 

L892 

46.15 

4 

LandsaM MSS-7 (solar infrared) 

1.921 

46.85 

5 

Topc^aphic slope 

1.911 

46.61 

6 

Averse number of yeaMound housing 

2,038 

49.71 


units per acre 



7 

LandsaM MSS-5/MSS-4 ratio 

2.035 

49.63 

8 

Median housing>unit value 

2.059 

50.22 

9 

LandsaM MSS-7/MSS-5 ratio 

2,063 

50.32 

10 

Landsat*! MSS-4 (visible green) 

2,091 

51.00 


C-P» 

Time 

Expended 

(seconds) 


1<^.5I 

20.92 

2126 

23.76 

25.66 
2.<95 

27.16 

28.26 

29,55 

31.66 


Coirect Potnts/OP 
Second Expended r:?) 


Step 


73.19 

81.31 

85.00 

80.85 
74.47 
78.54 

74.93 

72.86 
69.81 
66.05 


Average 


73.19 

77.39 

80.09 

80.30 

78.97 

78.89 

78.24 

77.45 

76.44 

75.15 


F-Valuc 
to Enter 


315.73 

88.79 

7.5.66 

61.53 

57.42 

40.50 

29.75 

26.33 

20.44 

21.77 


^MD « minimum distance. 


The second- and third-order barren land-use classes were displayed for the USGS land-use reference data plane 
discriminant-classified land-use map, and the actual discriminant-classfiication land^use map ’ 

^e three thir^order/orwr land-use classes were displayed for the USGS land-use reference data plane, the 
(figure discriminant-classified land-use map, and the actual discriminant-classification land-use map 


Verification 

The verification of classification results in this study was a highly structured procedure. An advantage of the 
^an 1 ative onentation taken throughout this endeavor was the explicit quantitative verification of results, 
face *^**^°* the computer lay not only in the capability of massive data manipulation, but also in the 

“ — — n Of the vaLs 
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Display scale 1:260,1 

(a) Six first*oraer lano-use classes identified by usus^urphdto interpretation. 


BLACK = All water areas 
DARKEST GRAY « All range areas 
DARK GRAY > All agricultural areas 
GRAY ■ All forest areas 
LIGHT GRAY « All barren areas 
LIGHTEST GRAY ■ All urban areas 
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Display scale 1:250,000 


(b) Verification map of the six first-order classifications (c) where points are plotted 
onfy if they check with USGS map (a). 

BLACK * All water areas (62.0% correct) 

DARKEST GRAY • All range areas (56.3% correct) 

DARK GRAY ■ All agricultural areas (56.7% correct) 

GRAY > All forest areas (66.3% correct) 

LIGHT GRAY “ All barren areas (35.0% correct) 

LIGHTEST GRAY > All urban areas (86.3% correct) 
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Display scale 1:250,000 


(e) Six first-ordar land-use classes identified using discriminant analysis with 
four im^e and six ancillary variables. 


BLACK 
DARKEST GRAY 
DARK GRAY 
GRAY 
LIGHT GRAY 
LIGHTEST GRAY 


All water areas 
All range areas 
All agricultural areas 
All forest areas 
All barren areas 
All urban areas 


FIGURE 67. COMPARATIVE DISPLAYS OF THE DIS- 
CRIMINANT CLASSIFICATION, A VERIFICATION OF 
THAT CLASSIFICATION. AND THE USGS MAP OF THE 
SIX FIRST-ORDER LAND-USES OF THE DENVER AREA. 
Scale 1 ;250,000. 
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Display scale 1:250,000 

(a) ElaMn second* and t5ird*erder (Urban larfdmse classes Identified by US6S airphoto interpretation. 


^ BLACK > Industrial Aransoortation 

k t . ¥ ,rrV DARK GRAY • Cemetery /recreational/open land 
OlCSG^ O medium gray - UtlUty/publicand institutional 

■pOOP' GRAY ■ Solid>watte dump/extraction 

LIGHT GRAY ■ ResidMtial/eommereial and services 
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Display scale 1:250,000 


(b) Verification map of the eleven second* and third*order uriMn lanchuse classifications (c) 
where points are plotted only if they check with USGS map (a). 

BLACK ■ Industrial/transportation (42.1% correct) 

DARK GRAY • Cemetery/recreatlonai/open land (0.1% correct) 

MEDIUM GRAY ■ Utifity/public artd Institutional (49.2% correct) 

GRAY • Solid*wasto dump/axtraetion (0.0% correct) 

LIGHT GRAY • Rasldantial/eommercial and services (78.0% correct) 
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(e) Eleven second* and thirdH>rd*r urban land-use dassas idMitif led using discriminant 
analysis with four image and six ancillary variables. 

BLACK • Industrial/transportation 
DARK GRAY • Cametery/fecreatlonal/open land 
MEDIUM GRAY • Utility/public and Institutional 
GRAY ■ ^id-waste dump/axtraetion 
LIGHT GRAY - ResMantial/eommareiai and sarvlcat 


FIGURE 68. COMPARATIVE DISPLAYS OF THE DISCRIM- 
INANT CLASSIFICATION, A VERIFICATION OF THAT 
CLAS^ FICATION, AND THE USQS MAP OF THE URBAN 
LAND USES OF THE DENVER AREA. Scale 1:250,000. 
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Display scale 1:250,000 

(b) Verification map of the three second-order agricultural land-use classifications (c) 
where points are plotted only If they check with USGS map (a). 


BLACK * Irrigated cropland (65.4% correct) 
DARK GRAY « Pasture (28.5% correct) 

LIGHT GRAY » Nonirrigated cropland (38.8% correct) 
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Disptay scda 1:260,000 

(e) Three second-order agricultural land-use classes identified using discrinsbiant 
analysis wiA four image and six ancillary variables. 

BLACK • Irrigated cropland 
dark GRAY ■ Pasture 
LIGHT GRA-Y-*t-Nonisrigated.ctopland 


FIGURE 69. COMPARATIVE DISPLAYS OF THE DISCRIM- 
INANT CLASSIFICATION, A VERIFICATION OF THAT 
CLASSIFICATION, AND THE USGS MAP OF THE 4G/?/- 
CULTUBAL LAND-USES OF THE DENVER AREA Scale 


1:250,000. 
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Display scale 1:250,000 

(b) Verification map of the ^ree second-order water-type classificationt (c) 
where points are plotted only if diey check with USGS map (a). 

BLACK > Streams and waterways (0.35% oorreet) 

DARK GRAY - Lakes (66.0% correct) 

LIGHT GRAY-*-Resarvoirt (68.1% dorriet) 
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Display scale 1:250.000 


(e) Three second*order water*type classes identified usino discriminant 
analysis with four image and six ancillary variables. 


BLACK " Streams and waterways 
DARK GRAY - Ukes 
LIGHT GRAY - Reservofars 


ORIGINAL PAGJ3 IS 
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FIGURE 7a COMPARATIVE DISPLAYS OF THE DISCRIM- 
INANT CLASSIFICATION.. A VERIFICATION OF THAT 
CLASSIFICATION^ ANO THE USGS MAP OF THE WAff/f- 
TYPE CLASSES OF THE DENVER AREA. Scale 1:250,000. 
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Display scale 1:260,000 

(a) Two saeond-order ranoaland dassas idantif lad by USGS airphoto Intarpratatloa 

BLACK ■ Chapparal (taken as brushland) 

DARK GRAY ■ Grasdand 
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Display scale 1:250,000 


(W Verification map of the two second^>rder rangeland dassif tartions (c) 
where points «re plotted only if they cneckwHh USGS map (a). 

BLACK ■ ChapparaMtaken as brushlandH40.9% correct) 
DARK GRAY “ Grassland (54.9% correct) 
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scale 1:250,000 



Display scale 1:250,000 

(c) Two second-order rangeland classes Identified using discriminant 
analysis with four image and six ancillary variables. 


BLACK “ Chapparal (taken as brushland) 
DARK GRAY « Grassland 
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FIGURE 71. COMPARATIVE DISPLAYS OF THE DISCRIM- 
INANT CLASSIFICATION, A VERIFICATION OF THAT 
CLASSIFICATION, AND THE USGS MAP OF THE /?A/VGf- 
L4A/DS OF THE DENVER AREA. Scale 1:250,00a 


155 


■v'r/’ 


FOCOOOX 



< 

i 



• •w 


Display scale 1:250,000 

(a). Jwo seeondoorder barraiMand classes identified by USGS airphoto interpretatton. 


BLACK ” Exposed rock 
DARK GRAY - HilMopes 
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(b) Varifieation map of the two second-order barrenland classifications (c) 
where points are plotted only if they cheek with USGS map (a). 

BLACK ■ Exposed rock (8.7% correct) 

DARK GRAY > Hiilslopes (38.5% correct) 
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(c) Two $econd*order barrenland datwi Identified using dUcrbnlnenf 
analysis with four Image and sbt ancillary earlables. 


BLACK - Exposed rock 
DAR K -Q RAY- n -Wlllslopei 


figure 72. COMPARATIVE DISPLAYS OF THE DISCRIM- j 
INANT CLASSIFICATION. A VERIFICATION OF THAT] 
CLASSIFICATION. AND THE USGS MAP OF THE BAR- 
RENLANDS OF THE DENVER AREA. Scale 1:260.(KK).j 
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Display scale 1:260,000 


(a) Three seeond*order forestland classes identif iad by USGS airphoto interpretation. 


BLACK • Deciduous/intermediate crown 
DARK GRAY “ COniferout/intermediate crown 
LIGHT GRAY ■ Coniferous/solld crown 
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Display scale 1:280,000 

(b) Verification map of the three secomhorder forestland classifications (e) 
where points are plotted only if they cheek with USGS map (a). 

BLACK ° Deciduous/intermediate crown (0.0% correct) 

DARK GRAY Coniferous/intermediate crown (2.8% correct) 

LIGHT GRAY « Coniferous/solid crown (81.3% correct) 


Itplay scale 1:250,000 
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Display scale 1:250,000 

(c) Two second-order forestland classes identified using discriminant 
anaiysis with four image and six ancillary variables. 

BLACK ■ Deeiduous/intermediata crovm 
DARK GRAY ■ Coniferous/intermediate crown 
LIGHT GRAY • Coniferous/solid crown ORIGINAL PAGr< • 

OP POOR QUAUr 


FIGURE 73. COMPARATIVE DISPLAYS OF THE DISCRIM- 
INANT CLASSIFICATION, A VERIFICATION OF THAT 
CLASSIFICATION. AND THE USQS MAP OF THE FOREST’ 
LANDS OF THE DENVER AREA. Scale 1:250,000. 
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L11”“|97M fU9ctlon.c|a„fflod tand-„.e m»p was cbeckod ce.l-by-oell abalnat .be ex- 

pandcd 197.-1973 USGS relerence map (Appendix C). The average verification accuracy for all second- 

wliereastbc average verification aceuraey forali first-order iand 
th.. ^ percent (table .4). The SO.l.percent final verification accuracy was in close agreement with 

the 51.0-percent verification accuracy achieved with the 1/81-grid sample of 4,100 pointsdurlng the de- 
velopment of the discriminant function (table 23). ^ 8 <it 


The first-order classlficatloit results were, in order of decreasing accuracy: 
• Urban land use (89.3 percent) 


• Forest land use (66.3 percent) 

• Water land use (62.0 percent) 

• Agriculture land use (5 6. 7 percen t ) 

• Rangeland use (56.3 percent) 

• Barren land use (35.0 percent) 

The second-order classification results were, in order of decreasing accuracy: 

• Residential (84.9 percent) 

• Reservoirs (58. 1 percent) 

• Lakes (56.0 percent) 

• Grassland (54.9 percent) 

• Institutional (51.1 percent) 

• Transportation (45.3 percent) 

• Chapparal (taken as brushland) (40.9 percent) 

• Industrial (40.5 percent) 

• Commercial and services (25.3 percent) 

• Bare exposed rock (8.7 percent) 

• Open and other urban (7.4 percent) 

• Streams and waterways (0.4 percent) 

• Kxtractive (0. 1 percent) 
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TABLE 24 

FINAL TEN-CHANNEL LINEAR-DISCRIMINANT CLASSIFICATION RESULTS USING FOUR 
LANDSAT AND SIX ANCILLARY LANDSCAPE VARIABLES. The verification accuracy of the 
machlne-processed sinole-date Und$at-1 Image (figures 67 through 73) are given. The tabulations 
wore based on chocking 24 land uses on a point-to-point comparison with the 1972-1973 USQS land- 
use data plane. The composite second- and third-order overage classification accuracy of 60,1 percent 
substantially agreed with the 61.0-percent value predicted by the 4,100-point grid-sampling ttainlnn- 
set sample (table 23), 


Finii'Orilcr UKc/timU 

SiVoml'O filer LaiuJ Utic/tmut CViver 
ritiril*Urik«r L«nU UKc/LunU (’over 

Total 

USGS 

(’ells 

Total 

(’ells 

(’orrect 

(’lassifleation 

Accuracy 

(percenn 

Urban and BuilbUp l.aiul 
ReKiilentiul 

Coimtierctal ami Nervieex 
Recreational 
Induiitriul 
lixtraetive 
Transportation 
Utilities 
Institutional 
Open and other urban 
Sol id- waste dump 
CVmctery 
Sub total/ Average 
Subtotal/Average 
Subtotal/Average 

12.825 
15,115 
1 ^,550 
4,824 
5, ‘02 
Ulo 
2‘U15 
22.25M 
254 
l.‘)7l 

202.5(i5 

18(i.l2‘) 

IM54 

8.L701 
5. 2.^8 

2.075 
4.007 

0 

2.085 

0 

14.008 

1.042 

0 

0 

180.781 

111.207 

2.075 

84.88 

25.25 
15,81 
40.55 

0.12 

45..14 

0.00 

51.00 
7.58 
OtOO 
0.00 

80.25 
50.80 

12.01 

Agricultural land 


n 


Nonirrigated cropland 

4<n02o 

ih.;;h 

^8 84 

Irrigated cropland 

720 

477 

05 4^ 

Pasture 

54.542 

0,850 

28.52 

Su b to tal / Average 

82.107 

4o.O|0 

5o 7^ 

Subtotal Average 

82.107 

28.555 

54.74 

Rangeland 




Grassland 

2.5.281 

l.l.87(. 

.54.80 

(’happaral 

2.7b5 

I.UV 

40.80 

Subtotal.^ Average 

28.044 

I.S.7K8 

5o.50 

Subtotal, Average 

28.044 

l.s.oos 

55.50 

Purest Land 




Deciduous intennittent crown 

810 

0 

! 

0.00 

('onilcrouH solid crown 

4.008 

5.500 

81.54 

(’onil'e roils ' i ntenn i t ten t crown 

144 

4 

2.78 

Subtotal/ /Vverage 

5.022 

5.528 

00.27 

Subtotal Average 

5.022 

5.515 

05.07 

Water 




Streams and waterways 

288 

1 

0.55 

Lakes 

4.050 

2.775 

50.02 

ReseiMnrs 

1.521 

885 

58.05 

Subtotal/ Average 

0.*».50 

4.100 

ol.oo 

Subtotal. Average 

0.750 

5.057 

.54.11 

Barren Land 




Bare expmed rock 

1.710 

140 

8.71 

Millslopes 

5.481 

2.112 

.^8.55 

Subtotal.' Average 

M0| 

2.515 

54 05 

Subtotal Average 

l.^io 

140 

8.71 

Subtotal. Average 

5.481 

2.112 

58.55 

Aggregate Grand l otal. Average 

55l.7-?o 


7sv52 

Aggregate ( \ rand 1 otal ' A\ erage 

222.042 

l.UUOX 

58.44 

Aggregate Grand Lotal Average 

100,154 


55.04 
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The third-order classification results were, in order of decreasing accuracy: 

• Coniferous/solid crown (81,3 percent) 

• Irrigated cropland (65,4 percent) 

• Nonirrigated cropland (38.8 percent) 

• Hillslopes (38.5 percent) 

• Pasture (28.5 percent) 

• Recreational ( 1 5.8 percent) 

• Coniferous/intermittent crown (2.8 percent) 

• Cemetery (0.0 percent) 

• Dedduous/intermittent crown (0.0 percent) 

• Solid-waste dump (0.0 percent) 

• Utilities (0.0 pereent) 

Several problem areas influenced the potential classification accuracy. One substantial difficulty was finding 
an image or ancillary landscape variable to adequately represent some of the USGS land uses as land-cover 


Anodier major problem involved the dual use of the 1972-1973 USGS land-use reference data plane for both 

classification verification. It should be recognized that this m«p was not 
100-percent correct Tlie stated 90-percent minimum accuracy of the map may be true for the first-order 
classes, but it was not verified for any level and therefore is doubtful. A 90-percent accurate USGS reference 
map checked point- .>y-point against an equally accurate classification map would yield an 81-percent accurate 

fio/acc*!!^^^ 81-percent verification value is close to the actual first-order verified classifica- 

tion accuracy of 76.3 percent A random-point field check of the final classification maps is tiie only sure 
verification technique; in using the USGS map solely as a training data source, a 10- or 20-percent error m?*' 

be statisvically forgiven by the statistical nature of the classification procedure, but it c'annot be overcome 
in a final pomHo-pomt comparison. 


most question about the 1972-1973 USGS map accuracy arose fmm a companiou 
U^S Study in which a i : 100,000-scale land-use map was similarly prepared from 1 : 1 20,000-scale high- 
altitude color-infrared imagery. Land use was subsequently verified by observation from a low-flying air- 

rthLl to be 77.4 pcrcx-ul for ! 8 sec-ond-order classes 

of the USGS Circular 671 system (Reference 44), 

Total classification time for the ten-variible discriminant function was 2,5 1 7.97 central-processor (C-P) 
seconds to map the 331,776 picture elements into 24 USGS land-use classes, which averaged ' 132 points 
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processed per ''m* second. An accuracy-weighted classification rate would be ^ 52 correct second-order 
points per C'-P second for tire 130.108 second-order ix)ints correctly classified. Similarly, a rate of ~ 101 
correct first-order points per (’-P second was reali/ed for the 253.2 1 9 points correctly classified at the first 
orilcr. 

Direct Cost/Time Analysis 


A-dctailed anah sis was performed to assess the various direct computer, labor, and material costs and times 
( table 25). These parameters were broket) down by submodel and task arca as accurately as possible. Direct 
computing cost.; repr.c-sented S5.509.05 of the total $6,981.55. or almost 79 percent of the entire direct- 
analysis co^ts. Only S 1 .472.50 (21 percent) was exfiended for direct labor and materials. More important, 
the equivalent average cost per unit area for 48-variable multivariate image classification, verification, and 
display was calcu-.»tcd as cither 1.9 cents per acre. 4.7 cents per hectare. $ 1 2. 1 2 per square statute mile, or 
$704. 19 per 1 :24.000-s»:ale USGS quadrangle. 

It was not consid e*'i appropriate to allocate direct or indirect labor costs other than those reflected because 
of the extensive iterative nature of any research effort. The time consumed in searching for, examining, and 
discarding potential imagery or maps, for example, was considerable, as was the computer program coding, 
ilebuggmg, and development. A production land-use mapping mode using these estimates would assume 
that all input date were on hand and that the computer software was fully implemented. 

These costs al , potentially represent the development, assembly, and testing of a very large data base on one 
or more limited map areas in order to optimize the selection of specific Landsat/landscape data planes. The 
extensi >tt e^f this classification to a larger area involves the smaller costs represented by analysis of tlie ten 
variables tin.i!ly clussitied here. Total costs for the classillcation used can be assembled for the same area 
as a subset of dc.elopment costs (table 25). These parameters are also broken down by submodel and task 
area and represent direct computer, labor, and material costs and times (table 26). The equivalent average 
cost per unit area for ten-variable multivariate image classintxition, verification, and display was calculated 
as either l.J cents per acre, 3.2 cents per hectare, S8.21 per square statute mile, or $477.1 7 per 1 :24,000- 
scale USGS quadrangle. 

SUMMARY 

The focus of this phase of the research investigation was the automated identification and mapping of the 
v arious land-usc/laiul-cover types that make up the Denver. Colorado landscape. This endeavor ua*d digital 
Landsat imagery and ancillary landscape variables to test different feature-extraction procedures and machine- 
classilication algorithms on a 48-variable data base. The grid-sampled-point feature-selection process made use 
of an existing ground-truth referc'ice map and proved to be superior to the conventional rectangular super- 
vised field selection procedure. 

A iionparautvtri'- classificuJion technique, linear-discriminant analysis with a priori class probabilities, clearly 
proved to be superior to the maximum-likelihood ratio method in common use today. These two algorithms 
were tested for identical combinations v)f four. six. and 22 variables. The 4,100-point 1/81 grid-sampling 
image was useil as the statistical traininu set for both approaches. 


The incremental contribution of ancillary spatial landscape variables to the classification of lomdsat spec- 
tral data was quantilied with a stepwise linear-dis< riminant function. A ten-channel combination of Landsat 
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TABLt;?b 

SCAp7vTr3^^ 4B VARIABLE LANDSAI ANCILLARY LAND 

SCRAPE VARIABLE CLASSIFU.AriON OH THE DENVER METROPOLITAN AREA. Thos.. Imu.os 

Univmitv ‘.r.l! I ..ul systom u«hI at Colo.ado State 

Mtv at the basic campus . escai ch . ate ol $?90 ,vi machine lunn ami an houi ly wo. k latio ot $& 

P.M man hour. Quotwl hflt.u*s .epresent only di.ect a>m,n.te.. lalmi. and .nateiial costs t hues They 
acanint to. o.te t.me Ci«ts only and .lo not .epument iedom« anythin,!. Many ol thecom,n.te. i«o 
„.am nanws rele, to paxitams o. modification ol punnams i.i the LMS !Meka.|e. othe.s .etc. to pio 
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TABLE 26 

COST/TIME TABULATION FOR SINGLE DATE, TEN VARIABLE LANDSAT/ANCILLARY LANDSCAPE 
VARIABLE CLASSIFICATION OF THE DENVER METROPOLITAN AREA. Thoso figiiti's aro based on the 
C.ontiol Data Coipoiation 6400 contputei and system used at Culoiadu State Univeisity at the basic campus le- 
seau'h late of $290 twi machine houi and an hourly wotk late of $5 pei man-hour. Quoted figures repiesent only 
diiect computei, lalnri, and matetial irosts/times. Tliey account foi one-time costs only and do not represent redoing 
anything. Only the steps that would actually be employed in extending the concepts learned with the 48-vaiiable 
tests weie costed. Foi ease of comparison, the aiea anaIvt/ocLheie was assumed to bt? the same sl/e. 
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basic bands and ratios produced an average verification accuracy of 38.4 percent. The addition of three 
anciUary variables (topographic elevation, average number of cars per family, and built-up urban- ire i 
nunmmm distance) increased thi. verification accuracy to over .SO perccnt.lut the 'inland- 

use spatial variables raised the final average accuracy to only .S.1.0 |u*rcent. 

An opiimni ton-ciwnnci combination of lour Lanuaal imago bands and six ancillary landscape vati iblc « ,s 

S b;T“ r, TuiSCsTand 't"'" ''--'"H'V 

With the I )7 -I )7.1 USC.S land-use rclerence data plane showed an overall accuracy of 7o..t percent for the 

SIX aggregate tirst-order classes. Average accuracy for the 24 second- and third-order classes was S8 4 -md 

‘Wrd-order categories received no correct' ‘ 

tabulation for the mulUvariate data compilation, registration, classification verification 
d duH lay showed the average cost ot developing the model to be 1.0 cents per acre. 4.7 cents per hecim- ’ 

tie' ^ ‘ quadrangle. Actual production application of the 

sUteted optimal channels and signatures yielded costs of l..^ cents per acre. .^.2 cents per hectare <:8 ^1 
per square statute mile, or $477. 1 7 per 7..5-minute USCS quadrangle. 
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CHAPTERS 

CONCLUSIONS 


SUMMARY 

Significant findings were made in the related study of spatial land-use projection and Landsat image classifi- 
cation. The first line of investigation was an attempt to estimate land-use changes using multivariate map 
variables ^d known points of change from aerial photointerpretation. Statistical relationships were developed 
v^^bl^^ nonchange land-use types with 27 physiographic, socioeconomic, and transportation 


The generd land-use change model was formulated as a linear-discriminant function using 4-ha (10-acre) 
square cells as prediction units. Map-derived independent variables were numerically related in the multi- 
variate structure of this model to known imagery-derived cells of change* 

^en models were tested for focusing upon cropland and pasture agricultural land- the primary 1963 
land-use change class. Change prediction accuracies ranged from 42 to 57 percent. Although these results 
were enwuraging, they were difficult to assess because of the scarcity of similar studies. Although the cur- 
rent analysis was tentative and exploratory, nonetheless it indicated the feasibility df developing spatial 

land-use projecwon models. Clearly, their application could be of considerable utility in future improved 
land-use plannmg activities. 

The par^lel appHcation of linear-discriminant analysis for Landsat multivariate image classification has con- 
firmed the value of this flexible statistical ter ^que. Various featufe-extraction methods and supervised 
machine-classification algorithms were tested on a single-date Landsat image augmented by 38 additional 
channels of ancillary map data spatially registered to tlie spectral-image data base. The overall objective 
was to systematically test these factors in producing a land-classification map, using a current USGS 
land-use map as a reference. 

The comparison of two sampling methods for statistical feature extraction indicated a wide variation in 
trial classification test results and applicability to image processing (Appendix E); The conventional 
rectangular training field selection process resulted in a veiy high test-classification accuracy, but was 
invalidated after poor classification results on a larger one-nirith image sample. However, the systematic 
pomt-p-id sampling gave highly repeatable results. This 1/81-image sample provided not only sample 
points distributed over the entire image, but also useful a priori land-use probabilities. 

These a priori class probabilities enabled the linear-discriminant function to far outclassify tlie widely 
used maximum-hkelihood ratio algorithm. Both procedures were compared on three test sets of image and 
ancillary landscape variables using the same point-training set developed earlier. 

Verification of the 576- by 576-point Landsat image classification with the USGS reference map showev 
an average accuracy of over 76 percent at the generalized first-order categories. The ten-variable discrim- 
inant function used also classified the 24 second- and tliird-order classes at accuracies of 58.4 and )3 0 
percent respectively. 
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A signillcunt achievement of this study was tlie creation of a composite Landsat image file consisting of 
ten spectral bands and ratios o *rlaid by 38 map-derived ancillar>' land*use, physiographic, socioeconomic, 
transportation, and MSS/insolation ratio variaWes, Statistical testing showed these collateral variables 
(excluding land-use data variables) Increased classification accuracy by over 40 percent; however, just 
three selected additional ancillary variables added over 30 percent of the increased accuracy. 

Tlie USGS Circular (>71 scheme, developed for high-altitude aircraft and satellite data inputs, worked 
reasonably well as a hierarchical classification system at both the first- and second-order levels in this 
supervised machine-interpretation effort. However, it appeared that much more detailed ancillary data 
will be needed for accurately interpreting the third-order urban classes. The inherent remaining problem 
in urban settings is the repetitive use of manmade building materials in a dlverslt>- of urban land uses. 
Fortunately, the USGS land-use classification system is predominately land-cover oriented and Is therefore 
useful for remote sensing discrimination. Tire ancillary data compensated, in large part, for the land-use 
versus land-cover confusion factor in the seven secorid-order urban classes but did not do well for the four 
third-order urban classes. Average classification accuracy was 59.8 percent for the second-order urban 
classes but only 1 \b percent for the third-order urban classes. Lastly, the fixed a priori classes of the 
USGS system t! enable the results of its application to be intercomparef’ from area to area pose a 
potential problem for the unsupervised clustering algoritlims coming into general use (Reference 45). 

The average direct cost for the optimized model for 24-class land-use mapping of the Landsat scanner 
imagery and associated ancillary map overlays was $0.0468 per hectare ($0.0189 per acre). Based on tliese 
costs, the mapping of a 7.5-mlnute USGS quadrangle would be $704.19, These unit-area costs included 
extensive geometric correction preprocessing, ancillary data registration, feature extraction, classification, 
verification, and display operations. Tire production costs of employing this approach were $0.01 28 per 
acre, $0.03 1 7 per hectare. $8.2 1 per «iuare statute mile, or $477. 1 7 per 7.5-minute USGS quadrangle. 
Computer time was billed at $290 per hour, and personnel wages were set at $5 per man-hour. 

RELATED APPLICATIONS AREAS 

A considerable quantity of landscape data was assembled during this study. An increasingly important- 
facet of land-use planning Is facilities siting, particularly fossil fuel, hydroelectric, geotliermal, and nuclear- 
power plants. Thus, a new, comprehensive, land-use planning data base must encompass even more 
ecological, environmental, hydrological, land-use, and natural resource data. 

These long-term construction projects are enormously costly, heavily regulated by governmental agencies, 
scrutinized by citizen watchdog groups, and vulnerable to location errors. Inadequate data may result in 
structures of the wrong size or scope with consequent losses from destruction or damage, failure to take 
advantage of economic potential, or excess capacity (Reference 25). Conse<iuently, these engineering 
projects are planned with consummate knowledge and care, and they provide an even more logical place 
to apply the landscape assessment and modeling methods developed here. 

More appropriate planning data and modeling of this type could be used to avoid agricultural investment 
where the climate is inappropriate or in designing dams, bridges, and irrigation works more In accord with 
the true characteristics of the site (Reference 25). 
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Most Importaiu. however, these comprehensive tlata bases will permit full Uevelopment of the computer 
moilelintt techniipies necessary for predicting how the landscape will spatially evolve in response to various 

i'l‘^T'>;ifives (References S and 4(>). The scenario simulated and evaluated could 
incluile diverse objectives, such as the control of shifting cultivation in underdeveloped countries, new 
zoning boundaries lor urban land planning, alternate sites for a new power plant, or the environmental 
impact ot siting a new dam. Land management in general, and land-use planning in particular, would be 
substantially improved if the future spatial impacts of a contemplated action could be accurately modeled 
belore an> a>mmitment U> a long-term, rigid course of action. 

RECOMMENDATIONS 


When rids >-y ear study was completed, six recommendations appeared to be appropriate. The first three 
deal with ancillary data inputs, and the remainder deal with machine interpretation and prvrcessing. Ml six 
recommendations impact on the landscape modeling approach to land-use planning outlined in (his study. 


Socioeconomic Census Data Densificatlon 


Ihe use ol census tracts in preparing socioeconomic data planes produced reasonable results in both the 
spatial land-use projection modeling and the Landsat land-use/ land-cover classification efforts. Unfortunately, 
although a -l-ha ( 10-acre) ancillaiy data mapping cell was superimposed, these data planes were not as highly 
resolved in a spatial sense as desired for urixm planning applications. I'he use of the smaller enumeration 
districts lor enumerating densely populated urban areas w'ould improve the data resolution three- to five- 
fold. 


Topographic Elevation Data Digitization 

I opograpbic elevation has been shown to be a primary constituent of the landscape nuxlel. Derivative 
data planes included topographic slope, topographic aspect, and solar radiation or insolation. I xperimenta- 
tion with Defense Mapping Agency (DMA) digital terrain tapes showed glaring deficiencies in the , ,ter- 
polated ol-m (200-lt) contours (Reference lo). I'luis. the only alternative was to tediously haiul- 
cellulari/e the topographic maps lor the precise elevation data needed for this study, ( onsiderable savings 
in both time and money can be realized if the digital terrain models created by the UStJS in construc- 
ting the 1 :24.00l)-scale. 7..S-minuie (optigraphic maps were saved on magnetic tape. 

Collateral Soils Data Inputs 


A son survey is a detailed physical imentoiy of the soil, showing its depth, texture, structure, drainage, 
slonincss. sKipe. erosu>n. and other land features that are helpful in determini • its use and capability. I'hese 
highly uselul data were m»t available for the Demer studv area. Soil t\ pe is certainly one of the principal 
determinants of land use (Reference :.S). Its inclusion in the data base would be a useful addition to the 
ancillary pio. siographic submodel for predicting spatial land-use changes under given circumstances and 
lor improving the I andsat image-cl.e silica tion results. 


Multidate Land*Ute Change Detection 

Lund-use change detection involves the comparison of remote sensing Imagery or other Inventory data for 
two different points In times Change detection on a spatial basis provides important planning data in 
managing tlic various landscape components, The use of high-altitude or orbital remote sensing data 
provides a near-orthographic view with attendant synoptic large-area overview. Tliese images must be in 
point geometric congruence if multiple images are to be processed in a digital computer and point clas- 
sifications are to be m; using statistical pattern recognition tecimiques. Thus, a principal advantage of 
Landsat digital imageiy is that time-differing scenes can be transformed to usable congruence much easier 
and at less cost per unit area than any type of aircraft imagery. It is titerefore mcommended that continued 
effort be expended to supply geometrically corrected data at various selected cell sizes. Because this need 
is common to all users, it should be centrally addressed for all data distributed. 

Symbiotic Use of Ancillaty Map Data 

The use of remote sensing images in generating change-detection data can provide metropolitan areas with 
the necessary information for structuring viable planes for the guidance and development of the evolution 
of these areas. However, for maximizing potential utility, remote sensing should be used with ancillary 
data sources. Although spectral Landsat data alone has worked well on natural cover types, spatial map 
data has proved its value in urban settings in which a single land use encompassed a plethora of artificial 
cover materials. It appears, therefore, that any reliable machine interpretation system will have to embody 
spectral, spatial, and temporal discriminants to function well for all anticipated urban land uses and con- 
ditions. The basic input to these automatic techniques is, and will continue to be. spectral data. However, 
both spatial and multitemporal map data must be incorporated into the spectral data base on an increas- 
ingly larger basis for further improvement in classification accuracy. 

Future Infb.mation Systems Development 

Proper use of available data, as well as new methods of data collection and analysis, are required for defining 
and quantifying tlie capabilities and limitations of the landscape components for local or regional level 
control and management. Electronic data processing (EDP) provides extensive computational power for 
storing, manipulating, retrieving, displaying, and updating resource data (Reference 9). Automated resource 
information systems, which spatially identify resource data, increasingly appear to be the most feswible 
means of providing complete, objective, and consistent Information and analyses (Reference 47). These 
systems will provide tlie basis for enllglitcned land-management decisions relative to the following questions: 

• What is the current land use? 

• How is the land use defined and evaluated? 

• What are its characteristics, dynamics, and limitations? 

• How much single- versus multiple-purpose land use is there? 

• What are the evolving socioeconomic spatial patterns? 
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• How much open space exists and how is it preserved? 

• How can land-use data be efficiently collected and used? 


system for" kaX’ thHnveVt^^^^^^^^^ automated resource-information 
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APPENDIX A 

MULTIPLE DISCRIMINANT ANALYSIS 


Multiple discriminant analysis finds the transform that yields the minimum ratio of the difference between 
group multivariate means to the multivariate variance within the groups. The statistical algorithm embodied 
in program CLASSIFY* (Appendix B) computes a discriminant function for each of tlie land«use classes by 
selecting the independent variables— the 48 Landsat image and ancillary map variables-in a stepwise fashion. 
The new variable entered at each step is selected on the basis of largest F-value to enter. An original set of 
observations on a picture element or cell, for example, is transformed into a single discriminant score by the 
discriminant function. The score represents the position of the picture element along the line defined by 
the linear>discriminant function. It is seen, then, that the discriminant function reduces a multivariate 
problem down into a univariate situation. 

The discriminant function is found by solving an equation of the form 

[*j] ♦ IX] = ID] 

where [s*J is an m-by-m matrix of pooled variances and covariances of the m variables. The coefficients 
of the discriminant function are represented by a column vector of the unknown lambdas. Lowercase Greek 
lambdas (X’s) are used by convention to represent the coefficients of the discriminant function. These arc 
exactly the same as the betas (/? s) also used by convention in regression equations. These should not be con- 
fused with the lambdas used to represent eigenvalues in principal components or factor analyses, nor the 
lambdas used to represent wavelength in a remote sensing sense. 

The right-hand side of the equation consists of the column vector of m differences between the means of 
the two groups in the simple discriminant-analysis case. The equation ca.n be solved by inversion and multi- 
plication, such as 


IX] * [s;]*‘ • ID] 

or by use of a simultaneous equation solution. 


The various entries in the matrix equation must be determined in order to compute the discriminant function. 
The mean differences are simply found as 


Dj = = 


i= 1 


M 


"b 

£ B 
1 = 1 
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•CLASSIFY occurs in the LMS package and is tlic modified version of BMD07M, which is pari of the UCLA biomedical statistical package 
available on most major computers (Refcrenec 1). The stepwise linear-discriminant analysis remains unmodUied; however, the inpul/output 
and internal control code was modified to handle the much larger data bases of remote sensina imagery. 


In this notation. A,j is the i"' observation of variable j in group A; A^ is the mean of variable j in group A, or 
the average of n^^ observations. Similarly, these eonventions apply to group B, The multivariate means of 
groups A and B ean be regarded as forming two veetore. The differenee between these multivariate means, 
therefore, also forms the veetor 

|I)jl = |A,| - |B^| 

A matrix of sums of squares and eross-produets of all variables in group A. as well as a similar matrix for 
group B. Is eomputed In order to eonstruet the matrix of pooled varianees and eovarianees. This Is done by 
conventional means for group A as 




u a 

- Am i.' A,k 
i=l 1=1 


Here, A,^ denotes the i^'* observation of variable j in group A as before, and A,^ denotes the i^'* observation 
of variable k in the same group. Whenever j = k, this quantity becomes the sum of squares of variable k. 
Similarly, a matrix of sums of squaa's and cross-products is found for group B as 

•V 


SPBjk = f. , (W - 


- 8ii - 
i= 1 '» 1=1 


The sums of products matrix from group A Is denoted as 1 SPAl , and that from group B as ( SPB 1 . The 
matrix of pooled variance Is now found as 


ISPA) + (SPBl 


This equation for pooled variance is exactly the same as that used in T-tests for the equality of multivariate 
means. Although the number of calculations involved ifi deriving the coefficients of a discriminant function 
appear to be large, they are less formidable than at first glance. The set of X eoeffieients are entries in the 
discriminant-function equation of the form 

R = X, + Xj C-j + • • • * 


This is a linear function whose terms are summed to yield a single number, the discriminant score. In this 
twivdlmensional example, the discriminant function ean be plotted as a line on the scatter diagram of the 
two original variables (figure 44 of the main text). It Is a line through the plot whose slope, «, is 


a = Xj/X, 
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Substitution ofthc midpoint between the two group means in this equation yields the discriminant Index. 
K©. Specilically, tor each value of \j/^, the following terms aa* Inserted 


'Die discriminant index, R©, is tla point along the discriminant-function line that Is exactly halfway between 
the center of group A and the center of group B. Next, the multivariate mean of group A can be substituted 
into the equation to obtain (that Is. - Aj). and the mean of group B to obtain R© = B,). These 
detlne the centeni ot the two original groups along tlie discriminant function for group A. 


Ra = 


X. A. 




and for group B, 


R. 


X, B, + Xj B, 


The members of group A that are located on the group B side of R© and the members of group B that are 
located on the group A side of R© are misclassified by the discriminant function. 


Tlie significance of the separation between the two groups can be tested if certain assumptions are made 
regarding the nature of the data used in the discriminant function. These five basic test assumptions are: 


( 1 ) Tlie observations in each group are randomly chosea 

T'he probability ot an unknown observation belonging to either group is equal. 

(3) Tlie variables are normally distributed within each group. 

(4) Tlie variance-covariance matrices of the groups are equal in size. 


(.S) None of the observations used to calculate the function were misclassified. 

Tlie most ditficult assumptions to justify are (2). (.^). and (4). However, the function is not seriously affected 
by limited departures from normality or by limited inequality of variances. The justification of (2) depends 
on an a priori assessment ot the relative abundance of the groups under examination ( Reference 2). 

A test tor the significance ot the discriminant function is developed from the T-statistic mentioned earlier. 

A “distance'’ measure between the two multivariate means can be calculated by simply subtracting R from 
R©. This is equivalent to substituting the vector of differences between the two group means into the dis- 
criminant equation, or setting the individual values of equal to 1)^. This distance measure is called 
Mahalanobis distance,” or generalized distance, l)^. it Is a measure of the separation between the two 
multivariate means expressed in units of the pooled variance. The T-test of this distance has the form 


r2 . 
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Tile T*tcst can be transfonned Into an F-test, becoming 


F « 


( nj, + nj, • ni - 1 ) 
(n„+nb*2>ni n. + n. 

8 O 


D2 




Ho= IDjl = 0 


against 


H,: [D^1>0 

The utility of this as a test of a discriminant function should be clear If the mean* nf tu * 

very close together, it will be difficult to separate them esoeciallv if hoth u 

the other hand, if the two means are weii-e<»nora* a ^ ^ ^ groups have laige variances. On 

k nHati,<ly ea^. well-saparated and scatter around the means is small, discrimination 

analyse. Because dlscrimtaan.t^Iijris"^^ ““ f-"« f"««« 

lot selecting the most effective set of predictois can also Led tl Ld 

inators. For example, the relative contrih..ti«n I, . . * ! ^ effective set of discrim- 

may be measured by a quantity, ana e j o the distance between the two group means 

Kj = X.D/D* 

where is the difference between the j'* means of the two erotim Thic » i 

contribution of the variable i and it does not ,vo««i i • * groups. This is only one measure of the direct 

variables in the discriminant function are not in i ^V” between variables. If two or more of the 

gmalcr eg, cm than r vTuc LTsurs,rmf^^^^^^ D’ to a 

parlial-mgmssion coeffeienu in m'ullSl “l' T' 

by multiplying by 1 00. ^ gression. Values ot may be simply converted to percentages 

references 
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APPENDIX B 

LANDSAT MAPPING SYSTEM 


The Landsat Mapping System (LMS) represents a total rewrite of the RECOG or RECOGnition Mapping 
System (References 1 througir B).* RECOG was intended principally for instruction at Colorado Jtate Univer- 
sity. and the new LMS software is compatible with it. However, the new system is specifically structured for 
both Landsat imagery inputs and composite mapping. Advantages cited for LMS include economy of opera- 
tion, flexibility, exportability to other CDC computers, and high-volume production (Reference 4). 

The LMS software is subdivided into four major image-processing steps. The first step prepares geometrically 
rectified digital scenes in a given scale or picture-element ground area with Landsat computer-compatible 
tape inputs (figure B-1 ). The second step interleaves multitemporal Landsat images, with the added option 
of overlaying multiple ancillary data planes derived from maps onto the multidate/multi temporal spectral 
data base (figure B-2). The third phase performs feature extraction computation and optimization of the 
mapping materials’ statistical signatures (figure B-3). The fourth and final step maps the spatial distribution 
of each desired material as rectified and scaled displays (figure B-4). 

The computer cost of LMS for preparing a single classification map of a 1 : 24,000-scale quadrangle was es- 
timated at $200. (See table B-1.) The LMS cost example involved three multidate images (twelve bands 
of multispectral data), but no ancillary data was involved. Significant additional economies of scale were 
realized in the application of many of these LMS modules to the largcr-scale Denver land-use/land-cover 
inventory effort reported here. Forty-eight Landsat image and ancillary map variables were overlayed. 
classified, and displayed for an average cost of $704.19 per 7.5-minute USGS quadrangle. 
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UP TO 4 TAPES, REPRESENTING ~2B MILE 
E-W SEGMENTS OF A GIVEN LANOSAT IMAGE, 
MAY BE INPUT SIMULTANEOUSLY. 


"CONVERTS" THE LANOSAT FORMP.V TAPE(S) INTO THE 
INTERNAL, SINGLE RECOG TAPE. ONwY THE PORTION 
OF THE IMAGE NEEDED TO OVERLAY THE SELECTED • 
MAP IS CONVERTED AND POOLED TOGETHER. 

"ROTATE" RESAMPLES THE ORIGINAL IMAGE CELLS TO 
REPRESENT ANY SIZE RECTANGULAR OR SQUARE CELL 
AS SELECTED BY THE USER. ADJUSTS FOR ORIGINAL 
IMAGE DISTORTIONS. SCALES IMAGE. TO MAP SCALE. ... . 
(E.G., 1:24,000). 



"FILTERS" THE IMAGE. 


"DISPLAYS" 1, 2, 3 ... OR ALL OF THE INDIVIDUAL 

SPECTRAL BANDS IN THE ORIGINAL OR MAP OVERLAY 
FORMAT. DISPLAY OPTIONS INCLUDE LINEPRINTER 
AND MICROFILM GRAYMAPS. 



"LANDSAT" COMPUTER COMPATIBLE TAPE (CCT) AS SUPPLIED BY EROS 
DATA CENTER. 



"RECOG" FORMATTED TAPE (OR DISK) FILE - AS STANDARD FORMAT TAPE 
USED THROUGHOUT THE IMAGE PROCESSING ACTIVITY, (n = 1 to 4) 


FIGURE B-1. STEP): IMAGE PREPARATION/M.^P OVERLAY. 


-cr 


0 . 













UP TO 10 RECOQ FORMATTED TAPES 
OF A VARYING NUMBER OF SPECTRAL 
BANDS ARE INPUT. 

"TRIMS" EACH RECOG FORMATTED TAPE 
(OR FILE) TO A SELECTED NUMBER 
OP LINES AND COLUMNS DESIGNATED 
BY THE USER, USUALLY THOSE 
NEEDED TO COVER MAP SELECTED. 
LINES AND COLUMNS ARE RENUM- 
BERED, BEGINNING AT 1, 1. 

COMBINES" RECOG FORMATTED DATA FROM THE 
TO 10 SEPARATE INPUT TAPES (FILES) INTO 1 
COMPOSITE RECOG TAPE (FILE) REPRESENTING 
A MULTIDATE, MULTISPECTRAL IMAGE. 

DISPLAYS" 1, 2, 3 ... OR ALL OF THE IN- 
DIVIDUAL SPECTRAL BANDS IN COMBINED 
IMAGE. DISPLAY OPTIONS INCLUDE LINE- 
PRINTER AND MICROFILM GRAYMAPS. 


(i + j are any integers) 


STEP Z AUXILIARY PROGRAMS. 

ANCILLARY" CREATES RECOG FORMATTED DATA FROM 
CELLULARIZED MAP DATA PLANES INPUT IN CARD OR 
MAGNETIC TAPE FORMAT. MAP CELLS MUST BE THE 
SAME SIZE OR SOME INTEGER MULTIPLE OF THE CELLS 
ON THE RECOG FORMATTED DATA WITH WHICH THE 
ANCILLARY DATA WILL BE COMBINED. 


CELLULAR 

MAPS 




ANCILLARY 


FIGURE B-2. STEP 2: INTERLEAVES IMAGES FROM VARIOUS DATES. 








SIGNATURE 


CELL 

MATRICES 


VARIABLES j 






(i i$ any integer) 


I 

“EXTRACTS" THE TRAINING FIELD DATA IDENTIFIED 
BY THE USER (RECTANGLES, IRREGULAR AREAS, 
AND POINTS) FROM THE RECOG IMAGE FORMAT. 

“TRANSFORMS" THE TRAINING FIELD DATA. FORMS 
RATIOS OF SPECIFIED SPECTRAL BANDS, USES 
ELEVATION OVERLAYS TO ADJUST SPECTRAL 
BANDS FOR TERRAIN SHADOWING, ETC. 

“CLEANS" OUT TRAINING FIELD DATA POINTS WITH 
LOW PROBABILITY OF BEING THE SELECTED 
MATERIAL OR HIGH PROBABILITY OF BEING 
SOME OTHER MATERIAL, ETC. 

“GROUPS" TRAINING SETS TOGETHER WHICH WERE 
ORIGINALLY SELECTED IN EXTRACT TO REP- 
RESENT SEPARATE MATERIALS BUT ARE NOW 
DETERMINED TO BE STATISTICALLY SIMILAR. 

“CLASSIFIES" THE TRAINING FIELDS USING 
MAXIMUM-LIKELIHOOD APPROACH (STEP- 
WISE DISCRIMINANT ANALYSIS). OTHER 
DECISION RULES CAN BE SUBSTITUTED HERE. 

“OVERLAYS" ANY VARIABLE 
OR RESULT IN POliyPT FILE 
INTO A RECOG FORMAT FOR 
DISPLAY AND MAP OVERLAY. 

“SIGNATURES" COMPUTES STA- 
TISTICAL REPRESENTATION OF 
EACH MATERIAL SPECIFIED BY 
THE USER FOR USE IN MAPPING 
THESE MATERIALS ON ANY 
DATA TAKEN FROM THE SAME 
ORIGINAL IMAGE. 

“PRINTS" OR “PUNCHES" OUT 
ANY VARIABLE(S) IN THE 
POINT FILE FOR FURTHER 
ANALYSIS IN ADDITIONAL 
PROGRAMS WRITTEN BY 
THE USER. 



“POINT" BY POINT TAPE (OR DISK) FILE. - AN INTERNAL TAPE, DISK, AND/OR 
CARD FILE FORMAT WHICH CONTAINS ONLY THE EXTRACTED TRAINING 
FIELD DATA AND DOES NOT MAINTAIN ITS CORRECT MAP OVERLAY 
POSITION. 


FIGURE B-3. STEP 3: COMPUTES STATISTICAL "SIGNATURES" OF 
MATERIALS TO BE MAPPED. 













'TRANSFORMS" DATA FOR EACH IMAGE CELL AS 
TESTED AND SELECTED IN STEP 3. 


'MAPS" OUT THE DISTRIBUTION OF EACH SURFACE 
MATERIAL SPECIFIED BY THE USER. 


'DISPLAYS" THE SELECTED IDENTIFICATION OF EACH 
IMAGE CELL AND/OR PROBABILITY THAT IT IS THE 
MATERIAL DESIGNATED. DISPLAY OPTIONS INCLUDE 
LINEPRINTER AND MICROFILM GRAYMAPS AND 
LINEPRINTER COLOR SYMBOL MAPS. 


(I and k are any integers) 

STEP 4. AUXILIARY PROGRAM. 

'ZOOMS" OR ENLARGES THE RECOG FORMATTED 
TAPE (OR FILE) BY ECHOING EACH IMAGE CELL 
"N" TIMES ON A LINE AND REPEATING EACH 
Ll^ : "M" TIME.S. 



FIGURE B-4. STEP4: MAPS DISTRIBUTION OF EACH MATERIAL 







TABLE B-1 

COST ESTIMATES FOR OPERATING THE LANOSAT MAPPING-SYSTEM 


Item 


Cost Estimate* 

CONVERT 


$5/date 

ROTATE 


$7/date 

FILTER 


$6/date 

DISPLAY 

$1 /band/ date X 2 bands* 

$ 2/date 

Step 1 


$20/datc 

Assuming three dates involved gives $20/date X 3 = 

$60 

TRIM 


$3/date 

COMBINE 

3 dates combined « 

$1 

DISPLAY 

$ 1/band/date X -1 band = 

$1 

ANCILLARY 


optional 

Step 2 

Assuming three dr.tes gives $3/date X 3 dates + $1 + $i = 

$11 

EXTi^ACT 


$10 (approx.) 

TRANSFORM 


$5 (approx.) 

CLEAN 

$2/iteration X 3 iterations * 

$6 (approx.) 

CLASSIFY 

$8/iteration X 3 iterations = 

$24 (approx.) 

SIGNATURES 


$2 (approx.) 

OVERLAY 


optional 

GROUP 


optional 

PRINT/PUNCH 


optional 

Step 3 

Based on 2,000 points - 


$50 (approx. ) 

TRANSFORM 


$5 (approx.) 

MAP 

Based on mapping 30 

$73 (approx.) 

DISPLAY 

material types 
Black-and-white lineprinter 

$1 (approx.) 

ZOOM 

s’-mbol ttiap 

optional 

Step 4 

Based on 30 classes mapped = 


$79 (approx.) 

STEP TOTAL* 


$200 (approx.) 


♦Estimated computer costs for 1 of 1 : 24,000 quad map with -- 1-acrc cells, three dates ( 1 2 spectral 
bands), 2,000 cells defining training fields. 30 material types, and black-and-white lineprinter display. 




"'t, .j 

h 
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APPENDIX C 

MACHINE CLASSIF1CA310N ERROR-RATE ESTIMATION 


appendix c 


MACHINE CLASSIFICATION ERROR>RATE ESTIMATION 


rationals have been explored In this endeavor. * * ° utility of the classifler. Both 

appro^^h '! parametric model. However, this 

overoptimistic because characteristics that make th * estimates of this nature ate almost always 

«vealed. Second, the v^d^^Tan^ulTf !,T‘l untepresentative will not b" 

this is «'"srallyTTC%'^iualtog «l»|imei«al t«tlng of the classifler. In practice, 

that anr mflclashfled as an estate of the e«or “e osl"oTm“ 1 “"f “ 

'Sr ■>">«“ •» • 

minimum achievable error mtrHo^ver the 

sacrificed for the advantages of worWng sdely ^ SmeVioa 

decision problem. 1^e"cow^UMtS m^ Prevalent in any multiclass yes-no 

designated a particular class when it is not and some othnr ^ ^ <^orrect or when a picture element is 

called type I and type II errorS°dy M^ erZ^wr '* 

errors, respectively, in pattern-recognition literature The two staSl referred to as omission and commissioi 
in a mnltldass situation, along with ^“^0 d«wl r, 7 c « f"!** 

decision table (table 0*1). e classifier can make, are indicated in a two-way 


TABLE C 1 


Decision 

Test Hypothesis j 

True 

Palsc 

Reject 

Accept 

Omission (type !) error 
Correct decision 

Correct Decision 

Commission (type II) 
error 
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TABLE C-2 

CI^SSIFICATION MATRIX USING FINAL TEN-VARIABLE LINEAR-DISCRIMINANT 
ANALYSIS. The verified omission/comm.ssion errors of the machine-processed single-date UndsaM image are given. The 
tabu ations are based on identdying 24 second- and third-order land uses, and checking only at the first order on a point-to- 
^mt Mmfwri»n with the 1972-t973 USGS land-use data plane. Land-use codes are identified In table 1 of the main text. 
Correctly classified class totals are Italicized. Verification accuracies are summarized in table 24 of the main text. 


IISGS 

Land-Use 

Oasses 



Number t 

)f Picture Flements 

Ommion 
Frrorsitype 1) 

Commission 
Krrors 
(type II) 

Urban 

Agriculture 

Range 

Forest 

Water 

Barren 

Total 

Total 

Percent 

Total 

P«r-cont 

l^rban 

tso, 7St 

13,563 

6,287 

175 

1,292 

465 

202,563 

21,782 

10.75 

35,383 

16.37 

Agriculture 

29,299 

46,619 

5,296 

0 

826 

157 

82,197 

25..S78 

00 

20,663 

30,71 

Range • 

4,005 

5,968 

/5. 7SS 

307 

' "53 •” 

—1,923 

28.044 

12,156 

42.70 

16.682 

51.28 

Forest 

246 

462 

780 

SJJiS 


204 

5,022 

1,694 

33.73 

406 

21.40 

Water 

1,830 

667 

72 

0 

4J90 

0 

6,759 

2,569 

38.01 

2,172 

34.15 

Barren 

3 

3 

4,248 

424 

0 


7,19! 

4,678 

65.05 

2,749 

52,24 

Total 

Hlements 

216,164 

67.282 

22.471 

4,234 

6,263 

5,262 

331,776 

78„S57 

22.68 

78..S57 

23,68 


Gassification matrices for the ten-variable image analysis of the 576- by ‘‘76-element August 15, 1973, 

^ndsat scene of the Denver Metropolitan Area were generated for the six first-order classes (table C-2) and 
tor the 24 second- and third-order categories (table C-3). 
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COMPOSITE SECOND- AND THIRD* 
LINEAR-DISCRIMINANT ANALYSIS 
Land$at-1 image are given. The tabulat 
comparison with the 1972-1973 USDS 
rectly..clas$ified .ciass totals are italida 


uses 

Land-Use 

Classes 

Number of 

11 

12 

121 

13 

14 

15 

151 

16 

19 

191 

192 21 

11 

83,761 

2,140 

1,428 

1,670 

25 

783 

0 

378 

2,102 

3 

0 1,< 


6,891 

3,238 50 

1,544 

7 

132 

0 

299 

496 

1 

0 


6jl29 

181 2,073 156 

24 

36 

0 

185 

586 

0 

0 ; 


3,797 

949 

37 4,997 

12 

606 

0 

610 

350 

6 

0 

14 

1,529 

119 

34 

1,119 


9 

0 

13 

269 

2 

0 , 

15 

442 

199 

7 

405 

30 

2,685 

0 

950 

4 

10 

0 ; 

151 

486 

35 

3 

177 

1 

133 0. 87 

35 

1 

0 

16 

8,316 

498 

196 

646 

12 

36 

0 14,968 

279 

20 

0 ! 

19 

15,027 

266 

327 

971 

9 

792 

0 

585 

'l,642 

0 

0 \ 

191 

61 

6 

0 

37 

1 

0 

0 

0 

^0 

0 

192 

1,192 

3 

357 

16 

0 

0 

0 

0 

46 

0 0^ 

211 

8,637 

45 

1,665 

657 

14 

2,151 

0 

1,160 

1,89(5 


0 18,1 

212 

20 

0 

25 

0 

0 

0 

0 

0 

0 

0 

0 

213 

7,841 

17 

910 

148 

26 

1,854 

0 

1,278 

959 

0 

0 6,. 

31 

1,630 

21 

70 

7 

10 

864 

0 

1,107 

276 

0 

0 

33 

15 

0 

2 

1 

2 

0 

0 

0 

0 

0 

0 

411 

141 

0 

19 

21 

0 

0 

0 

54 

11 

0 

0 

421 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

422 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

51 

78 

30 

1 

83 

0 

0 

0 

11 

0 

0 

0 

52 

780 

66 

64 

239 

10 

9 

0 

151 

29 

0 

0 

53 

153 

2 

1 

0 

16 

0 

0 

84 

23 

0 

0 

74 

3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

741 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Total 

Elements 

146,929 

7,815 

7,269 

12,894 

205 

10,090 

0 

21,920 

8,997 

45 

0 30,1 
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TABLE C-3 

ORDER LAND-USE CLASSIFICATION MATRIX USING FINAL TEN-VARIABLE 
> The verified omission/commission errors of the machlne-processed, slngle^Jate 
ons are based on identifying 24 second- and third-order land uses on a point-to-point 
land-uw data plane. Land-use codes are identified in table 1 of the main text. Cor- 
ed. Verification accuracies are summarized in table 24 of the main text 


Picture Elements 

Total 

Omission 
Errors (Type 1) 

Commission 
Errors (Type II) 

1 

212 

213 

31 

33 

411 

421 

42: 

51 

52 

53 

74 

741 

Elements 

Total 

Percent 

Total 

Percent 

401 

0 

3,337 

1,100 

117 

0 

87 

0 

26 

235 

21 

51 

11 

98,676 

14,915 

15.12 

63,168 

42.99 

11 

0 

109 

8 

0 

0 

0 

0 

8 

24 

7 

0 

0 

12,825 

9,587 

74.75 

4,577 

58.57 

?S6 

8 

2348 

364 

154 

0 

46 

0 

2 

176 

58 

119 

112 

13,113 

11,040 

84.19 

5,196 

71.48 

311 

0 

155 

98 

18. 

0 

0 

0 

1 

65 

1 

8 

9 

12..330 

7,333- 

59.47 

7,897 

61.25 

>00 

3 

372 

480 

45 

0 

42 

0 

8 

177 

51 

9 

37 

4,824 

4,818 

99.88 

199 

97.07 

282 

0 

702 

140 

17 

0 

0 

0 

11 

10 

9 

19 

0 

5,922 

3,237 

54.66 

7,405 

73.39 

51 

0 

30 

36 

0 

0 

0 

0 

0 

41 

0 

0 

0 

1,116 

1,116 

100.00 

0 

0.00 

155 

0 

788 

3,058 

108 

0 

0 

0 

102 

74 

3 

28 

26 

21,920 

14,345 

48.94 

6,952 

31.72 

Ml 

0 

1,145 

343 

9 

0 

0 

0 

10 

125 

11 

35 

1 

22,239 

20,597 

92.62 

7,355 

81.75 

5 

5 

1 

117 

0 

0 

0 

0 

0 

1 

0 

0 

0 

234 

234 

100.00 

45 

100.00 

89 

0 

158 

75 

0 

0 

0 

0 

0 

34 

1 

0 

0 

1,971 

• 1,971 

100.00 

0 

0.00 

228^ 

4.327 

6,436 

1,266 

la 

0 

0 

0 

88 

294 

48 

0 

0 

46,926 

28,698 

61.16 

11,796 

39.29 

207 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

729 

252 

34.57 

4,984 

91.27 

459 

635 

9.8S0- 

-3,862 

150 

0 

0 

0 

28 

305 

63 

157 

0 

34,542 

24,692 

71.48 

21,947 

69.02 

421 

0 

5.646 

1 8,876 

307 

0 

0 

0 

0 

27 

26 

562 

531 

25,281 

11,405 

45.11 

13,618 

49.53 

0 

0 

1 

47^ 

U29 

0 

307 

0 

0 

0 

0 

204 

626 

2,763 

1,634 

59.14 

3,848 

77.32 

121 

6 

335 

100 


■^7 0 

0 

0 

2 

0 

0 

0 

810 

810 

100.00 

0 

0.00 

0 

0 

0 

22 

572 

0 

S,J09 

0 

0 

0 

0 

12 

153 

4,068 

759 

18.66 

917 

21.70 

0 

0 

0 

0 

86 

0 

15 

"4. 


0 

0 

0 

0 

144 

140 

97.22 

4 

50.00 

69 

0 

0 

0 

0 

0 

0 

0 

15 

0 

0 

0 

288 

287 

99.65 

286 

99.65 

:362 

0 

337 

66 

0 

0 

0 

0 

2 2,778 ^ 

262 

0 

0 

4,950 

2,177 

43.98 

1,859 

40.13 

5S 

0 

44 

6 

0 

0 

0 

0 

0 

254 888.^ 

0 

0 

1,521 

638 

41.94 

561 

38.85 

0 

0 

3 

633 

729 

0 

64 

1 

0 

0 

0 749 

128 

1,710 

1„561 

91.29 

1,328 

89.91 

0 

0 

0 

1.368 

1,518 

0 

356 

3 

0 

0 

0 

124 2./ /2 

5,481 

3,369 

61.47 

1,673 

44.20 

324 : 

5,461 31,797 

27.494 

4,977 

0 4,226 

8 

287 4,632 

1.444 

1,477 

3.785 

331,776 

165,615 

49,92 

165,615 

49,92 


49.92 
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APPENDIX D 

MAXIMUM-UKELIHOOD RAHO 


APPENDIX D 


MAXIMUM-LIKELIHOOD RATIO 


The maximum-likelihood ratio elassificr. sueh as the classifieation subroutine CJLIKK in CSU's RHCOG system 

(Reference I ). assumes that the data are multivariate normally distributed. It utilizes a Bayesian decision 
rule of the form 


where 

p (Xll) 

2 -' 

A. 


P(Xli) = 




r(l/2)(X-A,)' xj' (X-A,) 


= the likelihood of occurrence of the feature vector. X. if it belongs to the i*'* class 
= the covariance matrix for the i'^'* class 
= the determinant of X, 

® the inverse of X, 

= the mean feature vector f jr the i'* class 


These conditional probabilities for classes are taken two at a time, ratioed. and evaluated to assign each picture- 
element vector to the class for which the likelihood, p (X I i). of the unknown vector is the higliest. or 

gj(X) = p(Xli) 


The class probabilities, p (i). are assumed to be equal in the maximum-likelihood ratio (Reference 
the modified maximum-likelihood ratio can exploit a priori class probabilities by taking 


2). However. 


g,(X) = p(i)p(X|i) 


The benefits of liaving a /v/ori class probabilities versus equal class probabilities were significant, as pointed 
out by comparative tests using the 4. 100-point training set (table 22) of tlie main text. 


/RECEDING PAGE 
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APPENDIX E 

correlation matrices for rectangular/point- 

sampled TRAINING SETS 




APPENDIX E 

CORRELATION MATRICES FOR 
RECTANGULAR/POINT-SAMPLED TRAINING SETS 

“ r " v-rtanoe-covaHanc. 


crea.es possibilities for dimensionality' .SucSot'''' “ “<■ 

(table E^) and the ten MSS bands/ratios rtL^a p f matrices for the four original MSS bands 

as well . forme «nal ten:!.S.f'rar:a^l^ 

Lmidsat/ancilla^ .aHable scene. TTe sire of .be dutiable 


TABLE E*1 

WITHIN-GROUPS CORRELATION MATRIX FOR THE FOUR ORIGlMAi mcc 

bands derived from rectangular traUng SE^^ Cf 

g lar training sets later proved to be unrepresentative of the laroer scene Oniw th« 

fTwln T ?'• ‘^a'^sat-1 image variables were added in a free stepwise 

__fashlon. The entry order is indicated by supersaipt numbers on the variable nam ^a. 

1_ Bands 

Bands 4^ I <2 T 72 I T7" 


43 

52 

1.000 


0.891 

1.000 

0.457 

0.423 

0.255 

0.201 


1.000 • 

0.635 







TABLE E-2 

WITHIN'GROUPS CORRELATION MATRIX FOR THE FOUR ORIGINAL MSS BANDS AND SIX RATIOS DERIVED 
FROM RECTANGULAR TRAINING SETS, This set of variables correctly classified 67.30 percent of the 2,413 total 
samples, bMt the rectangular training sets later proved to be non-representative of the larger scene. Only the lower diagonal 
of the matrix is shown because the upper and lower diagonals are symmetrical. The LandsaM Image variables were added In 
a free stepwise fashion. The entry order is indicated by superscript numbers on the variable names. 


Bands/ Ratios 


Bands/ 

Ratios 

4* 

5‘» 

6’ 

7' 

7/6^ 

7/5'“ 

7/4’ 

5/4* 

5/6« 

4 

1.000 









5 

0.891 

1. 000 








6 

0.456 

0.423 

1.000 







7 

0.255 

0.201 

0.635 

1.000 






7/6 

-0.195 

-0.233 

-0.179 

0.452 

1.000 





7/5 

-0.289 

-0.418 

0.179 

0.507 

0.394 

1.000 




7/4 

-0.377 

-0.382 

0,301 

0.731 

0.517 

0.732 

1.000 



5/4 

0.368 

0.719 

0.137 

-0.040 

-0.193 

-0.514 

-0.302 



5/6 

0.249 

0.358 

-0.365 

-0.238 

-0.082 

-0.336 

-0.374 



6/4 

-0.320 

-0.297 

0.642 

0.496 

0.027 


0.755 
L- 


-0.578 


TABLE E-3 

WITHIN-GROUPS CORRELATION MATRIX FOR THE OPTIMAL SET OF THREE MSS SPECTRAL BANDS AND FOUR 
LANDSCAPE VARIABLES DERIVED FROM RECTANGULAR TRAINING SETS. This set of variables correctly classified 
96.64 percent of the 2,413 total samoles, but the rectangular training sets later proved to be unrepresentative of the larger 
scene. Only the lower diagonal of the matrix is shown because the upper and lower diagonals are symmetrical. The Landsat*1 
image variables were forced in the predetermined order (table 1 3 of the main text), and the ancillary landscape variables were 
added in a free stepwise fashion. The entry order is indicated by superscript numbers on the variable names. ACRES = total 
census tract aaesge. ELEVS = topographic elevation, INTER = freeway interchange minimum distances; and URBAN = built- 
up urban-area minimum distances. 


Variables 


Variables 


MSS-4^ 


MSS-5^ 


MSS-7' 


ACRES^ 


ELEVS'" 


INTER’ 


URBAN* 


MSS-4 

MSS-5 

MSS-7 

ACRES 

ELEVS 

INTER 

URBAN 


1.000 

0.891 

0.255 

-0.035 

-0.032 

0.100 

0.02b 


1.000 

0.201 

-0.034 


1.000 

- 0.011 

- 0.001 

0.042 


1.000 

0.057 

0.396 

0.508 


1.000 

-0.089 


1.000 


I 

] 

( 

j 


0.015 

0.111 

0.049 


0.017 


0.014 


1.000 

















TABLE B-4 

WITHIN GROUPS CORRELATION MATRIX FOR THE FOUR ORIGINAL MSS 
BANDS DERIVED FROM GRID-SAMPLED TRAINING POINTa Thi» of vari- 
ablet correctly clattified 37.90 percent of the 4,100 total samples. Only the lower 
diagonal of the matrix Is shown because the upper and lower diagonals are symmetrical. 
The LandtaM image variables were added In a free stepwise fashion. The entry order 
is Indicated by superscript numbers on the variable names. 


Bands 

Bands 

4* 

5^ 

6^ 

V 

4 

1.000 




5 

0.914 

1.000 



6 

0.492 

0.439 

1.000 


7 

0.169 

0.101 

0.770 

1.000 


TABLE E-S 

WITHIN-GROUPS CORRELATION MATRIX FOR THE FOUR ORIGINAL MSS BANDS AND SIX RATIOS DERIVED 
FROM GRID-SAMPLED TRAINING POINTa This set of variables correctly classified 38.41 percent of the 4,100 total 
points. Only the lower diagonal of the matrix is shown because the upper and lower diagonals are symmetrical. The Landsat-1 
Image variables were added In a free stepwise fashion. The entry order is indicated by superscript numbers on the variable 
names. 


Bands/ 

Ratios 


Bands/ Ratios 

4^ 

5* 

6’ 

V 

7/6« 

7/53 

7/4® 

5/42 

5/6’ 

6/4‘» 

4 

1.000 










5 

0.914 

1.000 









6 

0.492 

0.439 

1.000 








_ 7 

0.169 

0.101 

0.770 

1.000 







7/6 

•0.172 

-0.180 

0.105 

0.256 

1.000 






7/5 

-0.530 

-0.626 

0.233 

0.626 

0.305 

1.000 





7/4 

-0.482 

-0.496 

0.359 

0.758 

0.334 

0.934 

1.000 




5/4 

0.557 

0.834 

0.226 

-0.047 

-0.139 

-0.639 

-0.416 

1.000 



5/6 

0.350 

0.449 

-0.151 

-0.425 

-0.103 

-0.608 

•0.601 

0.460 

1.000 


6/4 

-0.382 

-0.375 

0.595 

0.683 

0.282 

0.794 

0.871 

-0.294 

-0.534 

1.000 
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TABLE E*6 

SCAPE VARIABLE °F0R*'fuLl!*imIC^^^ O'" LANDSAT AND LAND- 

This set of variables correctly classified B1.00 pw,Lm 4^00 TRAINING POINTS, 

shown because the upper and lower diagonals are 8wmmetri/.oi f the matrix is 

variables were added in a free stepwise fashion. The entry order is indictt^ h* ^ ■*’*"‘^* ancillary landscape 

CDP10 => median housing-unit value; CDP18 " average number of famiii ^ ‘“Patscrlpt numbers on the variable names. 

housing units per acre; SLOPE " topographic slooe- ELEVS = tonnn ^ili * “ average number of year-round 

distances. topographic slope, ELEVS = topographic elevation; and HI9 " built-up urban-area minimum 


Variables 

MSS-4 

MSS-7 

CDPIO 

CDP18 

CDP19 

SLOPE 

ELEVS 

HI9 

7/5 

5/4 


LOGO 

0.169 

-0.128 

0.069 

-0.057 

-0.035 

-0.042 

- 0.022 

-0.527 

0.557 



Variables I 

1^188-7“ 

CDPIO® 

CDP!8^ 

CDP19® 

SLOPED 

ELEVSi 

HI92 

m 

5/4^ 

1.000 









-0.015 

1.000 








0.059 

0.447 








•0.084 

-0.120 

0.016 







-0.026 

0.100 



1.000 





^0.057 

0.259 

0.166 


0.319 

1.000 




-0.041 

-0.030 

-0.076 

-0.196 

0.103 

0.099 




0.626 

0.055 

-0.011 

0.038 

-0.014 

-0.055 

lijH 

1.000 


-0.047 

-0.059 

0.014 

-0.140 

0.006 

0.064 

Hi 

-0.639 

1.000 


210 






















